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Transient protein protein interactions are also involved in the 

INTRODUCTION recruitment and assembly of the transcription complex to spe- 

. „ cific promoters, the transport of proteins across membranes, 

Protein-orotein interactions are intrinsic to virtually every ^ Qf na;jve proteins cata l y zed by chaperonms, indi- 

cellular process. Any listing of major research top.es .n b.o.c,- — - ste p S of lhe trans i at ion cycle, and the breakdown ana 

ev-for example, DNA replication, transection translation forma ,; on of subcellular structures during the cell cycle 

splicing, secretion, cell cycle control, signal transduction, and ^ ^ , asmic microt ubules, the spindle apparatus, 

intermediary metabolism-is also a listing of proc esses in v , ear , amina and the nuclear pore complex). Transient com- 

which protein complexes have been cated as «ff ntl J , exes are much more difficult to study, because the proteins or 

components. In consequence, the analysis of the P^teins in P responsible for the transient reaction have to be 

these complexes is no longer the exclusive domain ot biocnem of this review 1S to descrlbe 

ists; geneticists, cell biologists, developmental biotogij t , mo- dentin ^ first devdQ e that have allo wed the.r 

lecular biologists, and biophysicists have by necessity a 1 gotten recent ^ m characteriza ,ion. 

into the act We attempt in this review to summa n« both .dentihcat.on^^^ .^^.^ ^ ^ a number of different 
classical and recent methods to identify proteins that interact measurab ,^ effects . Firsti the y can alter the kinetic properties 
and to assess the strengths of these interactions. Drote ins This can be reflected in altered binding of sub- 
Proteins that are composed of more than one subunit are of prote , . ^ ^ enuncjated by Monod et ,. 
found in many different classes of proteins. Some of the best- si rates, „ , cric prope rties of the complex. Thus, the 
characterized multisubunit proteins are those that, as ^original y 3 i)*^* ^f!^ nuclear antigen with DNA 
purified, contained two or more different ™ m f™™£™*? w mera se 5 alters the process.vity of the polymerase (174), 
include classical proteins such « h «moglob.n tr^tophan s^ £* nteraction of succinate thiokinase and a-ketoglutarate de- 
thetase, aspartate transcarbamylase core R N A polymerase. 1 , n K for succiny i coenzyme A by 30-fold 
QP-replicase, and glycyl-tRNA synthetase. Since these pro- £ binding of oxygen to hemoglobin 
teins purified as multisubunit complexes, the.r protein-protein ^^^^ re P gu i ation 0 f aspartate transcarbamylase are 
interactions were self-evident. in Hude reeulated by interactions of the protomers. Second, protein- 
Other well-known examples of multisubunit prote'n^nclude regulate Y ^ Qne common mechanism to allow for 
much more complicated assemblies of P 0 ^^^??^ E rate channeling. The paradigm for this type of complex is 
elude metabolic enzymes such as the pyruvate dehydrogenase pirate B ^ ^ Neuwspora crassa _ It a complex 
and a-ketoglutarate dehydrogenase complexes, the DNA rep- uyp^ p I f which out one of the two steps 
lication complex of Escherichia coli and other organisms he of two subun ^ ^ ^ 3 _ glycer0 , p ^ s . 
bacterial flagellar apparatus, the nuclear pore ^P'ex. »nd the o cQnversion of indoIe to tryptophan). The 
tail assembly of bacteriophage T4. Also inc luded . m th. group phat J . $ noncovalently bound> but „ IS preferen- 
are nbonucleoprotein complexes, such as he : s.gna I rec ogn. nte ^ ^ phan (241) . Many slrnlla r exam- 
tion particle of the glycosylat.on pathway, small nudear .bo i y . channeling have been demonstrated both 
nucleoproteins of the spliceosome and the nbosome itself. ptesot q{ & and between dlffer ent 
Although some of the subunits of these protein complexes are oe multifunctional polypeptide <see reference 
not tightly bound, activity is associated with a large structure aom review f Third , protein-protein interactions can result 
that in many cases is called a protein machine (5). format ion of a new binding site. Thus, an ADP site forms 
There are also a large number of transient protein-pro te.n in ™ ™ Qf the a and p subunits of Escherichia colt 
interactions, which in turn control a large number of cellular a . t hexokinasc binds one ATP molecule at 
processes. All modifications of proteins necessarily involve r, J f > > e a mmetric homodimer (209). and phos- 
such transient protein-protein interactions. These include the Uk ^.n u from slearothermophi !us binds both 
.nteractions of protein kinases, protein phosphatase g >cos>l pn g and ADp a , , he interface bcrween sub - 
transferases, acyl transferases, proteases, etc., with the r suiv ^ protein-protein interactions can inactivate a 
strate proteins. Such protein-modifying enzymes en^.. - • , . & ^ ^ wi , h the interaction of phage P^ 
laree number of protein-protein interact.ons in the eel, and r _ , amj SSQr (213)i with thc interaction of 
regulate all manner of fundamental processes such a cc cp .^.^ and whh the interactl0n ot 
growth, cell evele, metabolic pathways, and signal transduction. mp 
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olricc 17 gene 1.2 protein with /•- coU dGTP triphosphohy- 
dmlasc (1S6). Fifth protein-protein interactions can change 
fhc tpecLitv of a protein for its substrate; thus, the interacuon 
of lactalbumin with lactose synthase lowers the K„ fo glucose 
bv 1,000-fold (95), and the interaction of transcription factors 
with RNA polymerase directs the polymerase to different pro- 

"kJoU et al. (116) enumerated four advantages of mu'Usub- 
unit proteins relative to a single large protein with multiple 
sUes S t ^ much more economical to build proteins from 
impler subunits than to require multiple * ^Sta 

information to synthesize oligomers. Thus, for examp e act.n 
filaments and virus coats are much more «mply ambled 
from monomers than by translation of a large PolyP^.n of 
repeated domains. Similarly, it is much more convenient to 
ha^e one gene encoding a protein ^with different interacting 
-a^tn-'s such as some of the eukaryotic RNA polymerase 
s K ubuni'ts; than to have the gene for that subunit reiterated - 
each different polymerase. Second, translation of ^la ge ^ pro- 
teins can cause a significant increase in errors in trans lation. 
such errors cause a lack of activity, they are much _more eco 
„„~;,-.,ii v elimin^tpd bv preventing assembly of that subunit 
'intoth'e'complex than by eliminating the whole protein 1 h.rd, 
multisubunit assemblies allow for synthesis at one locale fol 
lowed by diffusion and assembly at another locale, this a lows 
for both faster diffusion (since the monomers are smaller) and 
compartmentalization of activity (if * ssemb V, h Th aV e ^n 
activity). Fourth, homooligomer.c prote.ns, if they have an 
advantage over monomers, are easily selected ,n evo ution _ f 
the oligomers interact in an an.iparallel arrangement; in th s 
case, a single-amino-acid change that increases interaction po- 
tential has effects al two such sites. 

Another advantage of multisubunit complexes ,s the ability 
to use different combinations of subunits to alter the magni- 
tude or type of response. Thus, for example, adult hemoglobin 
(«202) an P d fetal hemoglobin (a2y2) are each composed of 
heicrooligomers with a common a subunit; differences in the 
binding of oxygen in these hemoglobins allow oxygen to be 
read 1> passed from mother to fetus. Other examples include 
the o.igomerization of Jun with Fos or with itself, which Results 
in distmct activities in transcription because the different 
dinners bend DNA in opposite directions (114); the mterjcuon 
of TATA-binding protein with the transcription apparatus ot 
RNA polymerase' I P II, or III, in which TATA-b.nding prot in 
plays different roles (235); the '"leracuonsof microtubule 
with the large set of proteins to which they bind (1 T3) no al 
of which bind at the same time; the interaction of different 
transcription factors with core RNA polymerase in .both eu- 
karvotes and prokaryotes to direct transcription of different 
genes anu ^interaction of retinoblastoma (Rb protcm with 
viral oncoproteins and other cellular prote.ns (31, 32) 

Projein-protein interactions may be mediated at one ex- 
treme by a small region of one protein fitting into a clef in 
another protein and at another extreme by two surfaces inter- 
acting over a large area. Examples of the first case include the 
large class of protein-protein interactions that involve a do- 
main of a protein interacting tightly with a smaU P«^e_Th c 
paradigm for this type of interaction is that of specific Src 
homology 2 (SH2) domains with specific small peptides con- 
tain ngTphosphotyrosyl residue. This interacuon occurs with 
a dissociation constant as low as nM and ,s due to ; a specie 
hindin P pocket in SH2 domains no. unlike a classical substrate 
binding pocket (64. 205, 224, 225). Many other examples o, 
domain/that bind small peptides with affinities in the nano- 
molar to molar range have been described. The paradigm i fo 
the second case. i.e.. surfaces that interact with each other over 
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hrgc areas is that of the leucine zipper, in which a stretch oi 
a-hehx forms a surface that fits almost perfectly with anothci 
u-helix from another subunit protein (59, 161; also see refer 
crlce 4) Binding also occurs in the nanomolar range for such 
interactions (196). Other interactions may occur through in- 
termediate-sized complementary surfaces. 

It is evident that protein-protein interactions are much more 
widespread than once suspected, and the degree of regulation 
that they confer is large. To properly understand their signif- 
icance in the cell, one needs to identify the different interac- 
tions, understand the extent to which they take place in the 
cell and determine the consequences of the interaction. This 
review is intended to supply an overview of three aspects ot 
protein-protein interactions. First, we briefly describe a num- 
ber of physical, molecular biological, and genetic approaches 
that have been used to detect protein-protein interactions 
Second we describe several experimental approaches that 
have been used to evaluate the strength of protein-protein 
interactions. Third, we describe three well-characterized do- 
mains that are responsible for protein-protein interactions m a 
number of different proteins. As the literature on this topic * 
vast we have not attempted to conduct an exhaustive review 
Rathei, we hope that this article serves as a journeyman > 
euide to protein-protein interactions. 

The first and still the most comprehensive review on protein 
protein interactions is that of Klotz et al. ( 1 16) This review 
contains a survey of the subunit composition and binding en 
ereies of all oligomeric proteins that had been identified at the 
time as well as a discussion of the geometry of interactions anc 
an excellent discussion of the influence of binding constants 
concentrations, and cooperativity parameters on the popula 
tion of oligomers. A good discussion of channeling and com 
partmentation is found in the monograph by Fnednch or 
quaternary structure (70) and the article by Srere (208). 
review by Eisenstein and Schachman (57) contains an interest 
in* discussion of the functional roles of subunits of oligomerK 
proteins and of approaches used to determine whether the 
monomers of oligomeric proteins are active. Also of interest v. 
the discussion of proteins as machines (5) and a discussion o 
protein size and composition (78). 

PHYSICAL METHODS TO SELECT AND DETECT 
PROTEINS THAT BIND ANOTHER PROTEIN 

Protein Affinity Chromatography 

A protein can be covaiently coupled to a matrix such a 
Sepharose under controlled conditions and used to select i 
eand proteins that bind and are retained from an appropriate 
extract Most proteins pass through such columns or an 
readilv washed off under low-salt conditions; proteins that an 
retained can then be eiuted by high-salt ^ ut «L^ 
chaotropic solvents, or sodium dodecyl sulfate (SDS) (big. lj 
If the extract is labeled in vivo before the experiment, there ar< 
two distinct advantages: labeled proteins can be detected wit, 
high sensitivity, and unlabeled polypeptides derived from th 
covaiently bound protein can be ignored (these might be eithe 
proteolytic fragments of the covaiently bound protein or sut 
units of the protein which are not themselves covalentl 
bound) This method was first used 20 years ago to detec 
phage and host proteins that interacted with different forms c 
E coli RNA polymerase (177). Proteins that were retained b 
an RNA polymerase-agarose column (which was shown to b 
enzvmaucally active) but not by a control column coupled wit 
bovine serum albumin were judged as interacting candidate 
The interactions were substantiated in two ways, hirst, tn 
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FIG. 1. Protein affinity chromatography. Extract proteins are passed over a 
column containing immobilized protein. Proteins that do not bind flow through 
the column, and ligand proteins that bind are retained. Strongly retained pro- 
teins have more contacts with the immobilized protein than do those that are 
weakly retained. 



interaction of T7 0.3 protein with RNA polymerase was con- 
firmed by coimmunoprecipitation of the 03 protein with RNA 
polymerase antibody. Second, the interaction of T4 proteins 
with RNA polymerase was shown to depend on the form of 
RNA polymerase on the column: one T4 protein interacted 
with core RNA polymerase and T4-modified RNA polymerase 
but not with RNA polymerase holoenzyme, and another inter- 
acted only with the T4-modified polymerase. The phage pro- 
teins that bound RNA polymerase were identified by their 
absence in appropriate T4 and T7 mutants. 

Similar methods have been used, particularly by the labora- 
tories of J. Greenblatt and B. Alberts, to identify many other 
protein-protein interactions. Two excellent reviews on the 
topic, which cover many of the details of coupling and a num- 
ber of strategic considerations, have been published (69, 145). 

Candidate proteins can be coupled directly to commercially 
available preactivated resins as described by Formosa et al. 
(69). Alternatively, they can be tethered noncovalently through 
high-affinity binding interactions. Thus, Beeckmans and Ka- 
narek (14) demonstrated an interaction between fumarase and 
malate dehydrogenase by immobilizing the test enzyme with 
antibody bound to protein A-Sepharose, as well as by direct 
covalent coupling of the test enzyme to Sepharose. Some of the 
important considerations of a successful binding experiment 
are elaborated below. 

Purity of the coupled protein and use of protein fusions. An 
essential requirement for a successful protein affinity chroma- 
tography experiment is pure protein; otherwise, any interacting 
protein that is detected might be binding to a contaminant in 
the preparation. Greenblatt and Li (80) did two experiments to 
establish that core RNA polymerase bound to NusA on the 
column rather than to a contaminant in the NusA preparation. 
First, they demonstrated that a fully active NusA variant pro- 
tein, which presumably contained different amounts of various 



contaminants (since it elutcd at different positions in columns 
used to purify it), still bound core RNA polymerase; second, 
they demonstrated by independent experiments that the com- 
plex contained equimolar amounts of NusA protein and core 
RNA polymerase. 

The easiest way to obtain pure protein, if the gene is avail- 
able, is through the use of protein fusions. Several such systems 
have been described; in each case, the protein of interest (or a 
domain of the protein) is fused to a protein or a domain that 
can be rapidly purified on the appropriate affinity resin. The 
most common such fusion contains glutathione 5-transferase 
(GST), which can be purified on glutathione-agarose columns 
(202). Other fusions in common use include Staphylococcus 
protein A, which can be purified on columns bearing immu- 
noglobulin G; oligohistidine-containing peptides, which can be 
purified on columns bearing Ni 2+ ; the maltose-binding pro- 
tein, which can be purified on resins containing amylose; and 
dihydrofolate reductase, which can be purified on methotrex- 
ate columns. (Other common protein fusions which add an 
epitope for the influenza virus hemagglutinin [12CA5] or c- 
Myc are also in common use and are used most often for 
coimmunoprecipitation [see the section on immunoprecipita- 
tion, below].) 

Purified fusion proteins are used in two ways to detect in- 
teractions on affinity columns. First, the protein is covalently 
coupled to the resins in the usual way, as was done by Mayer 
et al. (139) to detect a tyrosine-phosphorylated protein that 
bound to the SH2 domain of Abl tyrosine kinase and by Weng 
et ai. (232) to demonstrate that the SH3 domain of c-Src binds 
paxillin. Second, the purified fusion proteins can be nonco- 
valently bound to the beads and then mixed with an appropri- 
ate extract or protein. This was done by Zhang et al. (248) to 
demonstrate an interaction of the N-terminal portion of c-Raf 
with Ras, by Flynn et al. (68) to detect the binding of an actin 
filament-associated protein to Src-SH3/SH2, and by Hu et al. 
(99) to demonstrate the binding of the SH2 domain of the p85 
subunit of phosphatidylinositol 3-kinase to two different 
growth factor receptors. 

Influence of modification state. The interactions of many 
proteins with their target proteins often depends on the mod- 
ification state of one or both of the proteins (mostly by phos- 
phorylation). Thus, the recognition of Rb protein by the tran- 
scription factor E2F and by the transforming proteins simian 
virus 40 large T antigen, human papillomavirus- 16 E7, and 
adenovirus E1A is more efficient with underphosphorylated 
than phosphorylated Rb (132, 133, 240). Conversely, SH2 do- 
mains of proteins, for example, recognize tyrosine phosphory- 
lated substrates several orders of magnitude more efficiently 
than they do their nonphosphorylated counterparts (64). Pro- 
tein-protein interactions that require a posttranslationally 
modified protein for interaction are not detected if the protein 
is purified by the use of expression vectors in cells in which the 
protein is not properly modified. A means to circumvent this 
problem is to use GST fusion vectors to express proteins in 
host cells more related to their origin. Thus, the interaction of 
bovine papillomavirus E5 oncoprotein with an a-adaptin-like 
molecule was confirmed by addition of beads to extracts of 
NIH 3T3 cells that were expressing the GST-E5 fusion (38). 
Similarly, a yeast GST vector that allows regulated expression 
of yeast GST fusion proteins has been described (148). 

Retention of native structure of the coupled protein. Failure 
to detect an interacting protein can result from inactivation of 
the protein during coupling. Ideally, coupling would immobi- 
lize a protein or a complex by randomly tethering it to the 
matrix through one covalent bond. For example, binding of E. 
coli proteins to immobilized \ N protein occurred only when 
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i \ (( ^MHrVactivatcd residues on the ma- 

Icrniiiung tol the coupled pro.e.n has " 

urnS the desired polymerized form was stabilized w, pha - 
loidin (or by chemical cross-linking), and the prote ns ma 
bound F- act in were shown not to bind monomer.c act in 1£ 
Similarly, microtubule columns were stabilized with taxol 

°!J„ ) t :.. e oro ,„in structure also depends on all subunits of a 
-S? b P ein g present in 

oTthe submit C c ^os^of" t Su'ted materia, with^hat of 
leaning material In the case of £ ccfi RNA po ymerase, .1 
... . „ r , n ^« nl c of tbp pnzvme were still present (17/). in me 
Si RNA polymerase II. one of the subunits 
HiH nnt renroduciblv remain after coupling (^UO). 

Con en« P rat?on oVthe coupled protein. To detect interactions 
efiSly, the concentration of protein covalently bound to the 
co mn has to be well above the K„ of the interaction. Thus, for 
the detecdon of weak protein-protein interacts, the concen- 
ration of b 0U nd protein should be as high as poss.be Weak 
nteracUons can b P e completely missed on columns with lower 
of coupled protein, even if they contain corre 
soondinelv larger amounts of resin to maintain the same total 
P IS boSnd protein (see the sections or , impor Urn e o 
characterization of the binding interaction and on binding 
immobilized proteins, below, for a discussion of this point) 
"Trnount „f P extract applied. The amount of extract apphe I to 
the column can be crit.cal for two opposing reasons. If too 1. tie 
ettrac "applied and the protein that binds is present at _low 
concentration, too little protein will t 
even if it binds with high affinity and is labeled with S (see tor 
, c -mM Pnnverselv if too much protein is 

and Alberts (144) in looking for minor protein species that 

^mhrc'oLrdtraUons. There are four distinc, ^advantages of 
protein affinity chromatography as a technique for detecting 
protein-protein interactions. First, and most important pro- 
Tein affinity chromatography is incredib ly ««mve. « a ; 
nmnriate use (high concentrations of immobilized test pro 
P ein) it can detecfinteractions with a binding constant as weak 
as 10 5 M (69) (see the section on binding to immobilized 
pro tcin below) This limit is within range of the weakest ,n- 
tera ion likely to be physiologically relevant, which we esti- 
mate to be in tL range'of 10"* M (see the ^<^™£° 
bindinc-constant considerations, below). Second '"is tech 
niauc "tests all proteins in an extract equally, thus extract 
proteins tha, aredetected have successfully competed for he 
fest protein with the rest of the population of proteins. Third 
Us easv ,o examine both the domains of a protein and the 
ntl-Tre'due, within it that are responsible for a spec, c 
interaction bv preparing mutant derivatives (38 21b) Four . 
interactions that depend on a multisubunit tetnereu pro..,, 
ran be detected unlike the case with protein blotting 

One pS,a problem derives from the very sensitivity of 
the technique. Since it detects interactions that are so weak. 



ipj.-pcndcnt criteria must be used to establish that the m/.c. 
actn is physiologically relevant. Detection o, I a la.se -p«s. - 
m..ivi1 cm arise for a number of other reasons. First, the pro 
^ may b nd the test protein because of charge interact,,^; 
l^-this^ason it is desirable to - a « ' column with 

^Z^^Ts^^^^ with the 
Zl'pZZ although interesting in itself, interaction may 
not be direct. Third, the proteins may interact with high spec 
"ficitv even though they never encounter one another in the 
cell. The most famous example of this type is the high affinity 
of actin for DNase I (125). 

For all of these reasons, the prudent course is to indepen- 
dent demonstrate the interaction in vitro or, if possible, in 
%o Cosedimentation was used to confirm the interaction of 
RAP 72 (now known as RAP 74) and RAP 30 with RNA 
polymerase „ (206), NusA protein with core RN A po ymerase 
r S f)< and NusB protein with nbosomal protein S10 (138). In 
other case; more biological criteria were used. For examp c 
antibodies were generated against many of the pro em that 
interacted with F-actin (but not monomer.c G-aclu on .o. 
umns and these were used to demonstrate that more than 90 /. 
T" J _„-^„ ndin „ proteins were localized with an act.n-l.ke 
d stribuTioXinS ^mitosis of Drosoplula embryos at the syn- 
c^ a blastoderm stage of development (.44). The identifica- 
tion of three yeast actin-binding proteins was confirmed in 
three separateways: one of the proteins was shown to corre- 
nond to the yeast analog of myosin by virtue of a shared 
Soe another protein colocalized with actin cables and cor- 
tfc l aciin patchel and overproduction of the third protein 
caused a reorganization of the actin cytoskeleton (53). In the 
tailof microtubule-associated proteins, two criteria 
were used to demonstrate the authenticity of the results (113). 
Fh-st antibodies for 20 of the 24 candidate microtubu e-asso- 
c ated proteins stain various parts of microtubule structure o 
DroToSTa embryos during the cell cycle. Second many (but 
not alO , oi the mTcrotubule'-associated protein. .isolated on mi- 
crotubule affinity columns are the -me as those .so U ted by 
traditional cosedimentation methods of Vallee and Collins 



^Failure to detect an interaction can occur for a number ot 
techS masons, described above. A false-negative result can 
an e fo W additional reasons: the interacting protein may 
nn? he able to exchange with another protein to which it is 
SSdSg or the two h pro 8 teins may not be able to interact both 
with each other and with, the resin. . 

Protein affinity chromatography does not always yield an- 
swer corresponding to other approaches. For reasons that are 
S 1 1 ^ number of proteins were detected by probing 
sns nolvacrvlamidc gel electrophoresis (PAGE) gels with a 
GST fu fon o the SH2 domain of Abl tyrosine kinase, but only 
a couolc of proteins were detected on columns coupled with 
th s Srmein 039). Similarly, a specific protein was detected on 
F -X columns stabilized by subcrimidate cross-l.nk.ng but 
, rZlloidin (144') Finally, G-actin interacting proteins 
ar very d5n cu t to detect with columns of G-actin although 
Teh columns bind DNase I; by contrast, DNase I columns can 
be used to detect such G-actin interactions (24). 

Affinity Blotting 

In a procedure analogous to the use of affinity columns, 
J P -f k fractionated by PAGE transferred to a n.t o- 
Slo"c membrane, and identified by their ability - lobind a 
pm ein peptide, or other ligand. This method is similar u 
CunoWottine (Western blotting), which uses an antibod> a. 
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the prohc. Complex mixtures of proteins, such as total-cell 
lysates can be analyzed without any purification. Therefore, 
this method has been particularly useful for membrane pro- 
teins such as eel! surface receptors (see reference 207 for a 
discussion). Cell lysates can also be fractionated before gel 
electrophoresis to increase the sensitivity of the method for 
detecting interaction with rare proteins. 

Considerations in affinity blotting include the biological ac- 
tivity of the proteins on the membrane, the preparation of the 
protein probe, and the method of detection. Denaturing gels, 
which are run in the presence of SDS and sulfhydryl reducing 
agents will inactivate most proteins and separate subunits of a 
complex. These denaturants are removed during the blotting 
procedure, which allows many proteins to recover (or partially 
recover) activity. However, if biological activity is not recover- 
able the proteins can be fractionated by a nondenatunng gel 
system. This method eliminates the problem of regeneration of 
activity and allows the detection of binding m cases when 
binding requires the presence of a protein complex. 

The protein probe can be prepared by any one of several 
procedures, and, as with affinity columns, the recent advent of 
fusing tags to the protein has greatly facilitated this purifica- 
tion. Synthesis in E. colt with a GST fusion, epitope tag, or 
other affinity tag is most commonly used. The protein of inter- 
est can then be radioactively labeled, biotinylated, or used in 
the blotting procedure as an unlabeled probe that is detected 
by a specific antibody. Vectors that incorporate into the pro- 
tein a short amino acid sequence recognized by the heart 
muscle cyclic AMP (cAMP)-dependent protein kinase provide 
another convenient means for in vitro labeling (18). 

One example of affinity blotting is the study of calmoduhn- 
binding proteins (77). Calmodulin can be 125 I labeled and used 
either to probe a gel strip directly or to probe a nitrocellulose 
membrane after transfer of fractionated proteins. Because the 
extent of rcnaturation of calmodulin-binding proteins is vari- 
able the assay is not quantitative. False-positive results have 
been detected in which a basic sequence binds calmodulin, 
although generally this binding is Ca 2+ independent. A major 
advantage of this technique is that in the analysis of a multi- 
meric protein that binds calmodulin, the precise binding 
polypeptide can be readily identified by affinity blotting with 
calmodulin. Using a combination of genetic approaches, 
Geiser et ai. (73) identified the spindle pole body component 
SpcllO (Nufl) as interacting with yeast calmodulin and then 
used affinity blotting to demonstrate that labeled calmodulin 
could directly detect a GST-Spcl 10 fusion transferred to a blot 
after fractionation by SDS-PAGE. 

Affinity blotting has been widely used in studies of the as- 
sociation of the regulatory' subunit of the type II cAMP-de- 
pendent protein kinase with numerous specific anchoring pro- 
teins (reviewed in reference 29). Two-dimensional procedures 
of isoelectric focusing followed by SDS-PAGE have been used 
to increase the separation of these anchoring proteins. As a 
control in some of these experiments, a mutant of the regula- 
tory subunit that is deleted for the first 23 residues did not 
detect any anchoring proteins. 

Immunoprecipitation 

Coimmunoprecipitation is a classical method of detecting 
protein-protein interactions and has been used in literally 
thousands of experiments. The basic experiment is simple. Cell 
l"satcs are generated, antibody is added, the antigen is precip- 
itatecTand washed, and bound proteins are eluted and ana- 
lyzed Several sources of material are in wide use. The antigen 
used to make the anybody can be purified protein (either from 
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the natural tissue or organism or purified after expression in 
another organism) or synthetic peptide coupled to carrier, and 
the antibody can be polyclonal or monoclonal. Alternatively, 
the protein can carry an epitope tag for which commercially 
available antibodies are available (12CA5 and c-Myc are in 
common use) or a protein tag (such as GST) for which beads 
are available to rapidly purify the GST fusion protein and any 
copurifying proteins. Glutathione-agarose beads were used, for 
example, to detect and characterize a GTP-dependent inter- 
action of Ras and Raf (227) and to demonstrate that the v-Crk 
SH2 domain binds the phosphorylated form of paxillin (16). 
The GST fusion immunoprecipitates a 70-kDa protein that 
reacts with anti-paxillin antibody and with anti-phosphoty- 
rosine antibody; moreover, anti-paxillin immunoprecipitates a 
protein that reacts with anti-Crk antibody but only under con- 
ditions when the paxillin is phosphorylated. 

Several criteria are used to substantiate the authenticity of a 
coimmunoprecipitation experiment. First, it has to be estab- 
lished that the coprecipitated protein is precipitated by the 
antibody itself and not by a contaminating antibody in the 
preparation. This problem is avoided by the use of monoclonal 
antibodies. Polyclonal antibodies are usually preadsorbed 
aeainst extracts lacking the protein to remove contaminants or 
are prepurified with authentic antigen. Peptide-derived anti- 
sera (which are usually made by coupling of the peptide to a 
carrier protein) are usually preadsorbed against the protein 
that was coupled, to remove antibody against the carrier, m 
addition to the usual purification to remove contaminating 
antibody. Second, it has to be established that the antibody 
does not itself recognize the coprecipitated protein. This can 
be accomplished by demonstrating persistence of coprecipita- 
tion with independently derived antibodies, ideally with spec- 
ificities toward different parts of the protein. Alternatively, it 
can sometimes be demonstrated that coprccipilation requires 
the presence of the antigen; cell lines, growth conditions, or 
strains that lack the protein cannot coprecipitate the protein 
unless the antigen is added. In certain cases, it can also be 
shown that antibody generated against the coprecipitated pro- 
tein will coprecipitate the original antigen. Third, one would 
like to determine if the interaction is direct or proceds through 
another protein that contacts both the antigen and the copre- 
cipitated protein. This is usually addressed with purified pro- 
teins, by immunological or other techniques. Fourth and most 
difficult, is determining that the interaction lakes place in the 
cell and not as a consequence of cell lysis. Such proteins ought 
to colocalize, or mutants ought to affect the same process. 

A particularly good example of this technique is the dem- 
onstration that adenovirus E1A protein interacts with Rb pro- 
tein A mixture of monoclonal antibodies against E1A coim- 
munoprecipitated a discrete set of five polypeptides (and some 
smaller ones) from a cell line expressing E1A, including a 
narticularly abundant one of 110 kDa (84). Four lines of evi- 
dence supported the claim that the 110-kDa polypeptide was 
forming a complex with El A protein. First, coprecipitation was 
not specific to a single antibody; three independent monoclo- 
nal antibodies against El A protein coimmunoprecipitated this 
protein. Second, these antibodies did not themselves recognize 
or immunoprecipitate the native or denatured 110-kDa pro- 
tein although they recognized and immunoprecipitated native 
and ' denatured El A protein. Third, coprecipitation required 
El A protein; the 110-kDa polypeptide could be immunopre- 
cipitated from HeLa extracts (which do not contain El A pro- 
tein) only if a source of E1A protein was added. Fourth, the 
complex could be detected independently m crude lysates; a 
subpopulation of El A protein in lysed cells sedimented at 10S 
rather than at 4S, and this subpopulation contained coimmu- 
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noprec.pitabie 110-kDa protein. A similar 11 0-kl a p ten ; 
well as a similar set of other protc.ns) was also .dent.f.cd w 
antipeptide antisera against E1A prolc.n 242) I wo so par. tc 
ami era (one against an amino-termiP.al pcpt.d and 
against a carhoxvl-terminal pept.de) each coprecipitated the 
1 10-kDa polypeptide, and coprecipitation was prevented e.tner 
with an excess of the corresponding El A peptide antigen or .n 
cell extracts lacking El A protein. - . ■ 

Subsequent studies established that this 105- _to 10-kDa 
polypeptide was the Rb gene product (236). To this end, 
rno^oclona. antibodies against the 110-kDa protein ^ere pre- 
pared bv immune purification of the 110-kDa pro te.n The 
resulting antibody coprecipitated El A protein, just as ng- 
E1A coprecipitated the 110-kDa protein. Since the 110-kDa 
protein was the same size as Rb protein, and since it was 
present in a wide variety of cell lines but not in ce 1 _ line known 
to contain deletions of the Rb gene it seemed likely that he 
110-kDa protein was Rb protein. This was proved by using 
anti-Rb peptide antibodies against different regions oT Rb m 
three experiments. First, 110-kDa protein precipitated with 
anti-1 10-kDa antibody comigrated and had the same partial 
peptide map as that precipitated with anti-Rb annbody Sec- 
ond, 110-kDa protein precipitated will. i A n ™" f-> 
could be detected in immunoblots with two different ant.-Rb 
antibodies, and this detection was inhibited by the correspond- 
ing peptide antigen. Third, anti-1 10-kDa antibody could ,m- 
munoprecipitate Rb protein synthesized in vitro. 

When coimmunoprecipitation is performed with ^supple- 
mented crude lysates, as is often the case, this technique has 
four distinct advantages. First, like P™ tc «^»« n '^. h ™" 1 ^- 
raphy, it detects the interactions in the midst of all .the com- 
peting proteins present in a crude lysate; therefore, the results 
from this sort of experiment have a built-in specificity control 
Second, both the antigen and the interacting protc.ns are 
present in the same relative concentrations as found in the ce 
therefore, any artificial effects of deliberate overproduction of 
he test protein are avoided. Third, elaborate complexes are 
already in their natural state and can be readily coprecipitated 
such complexes might otherwise be difficult to assemble , in 
vitro. Fourth, the proteins are present ,n their natural state of 
posttranslational modification; therefore interactions that re- 
quire phosphorylation (or lack of phosphorylation) are more 
realistically assessed. Two disadvantages are also apparent. 
First, coimmunoprecipitating proteins do not necessarily inter- 
act directly, since they can be part of larger complexes. l or 
example, the coprecipitation of E1A and p60 (now known to 
be cyclin A) (84) occurs indirectly; El A interacts with pK>7 
(237) and P 107 interacts with cyclin A (61, 62) Similarly, 
coprecipitation of Rb protein with E2F probably occurs 
through another protein (92, 179). Second, coprecip, at. on is 
not as sensitive as other methods, such as protein affinity chro- 
matography, because the concentration of the antigen is lower 
Than it kin protein affinity chromatography. This can be over- 
come by deliberately adding an excess of the antigen o h 
crude lysates to drive complex formation, as was done to detect 
a 46-kDa protein that competed with simian virus 40 T antigen 
for Rb protein (100). It can also be overcome by covalently 
cross-linking the proteins prior to immunoprec.pitation (48) 
see the section on cross-linking, below). These alterations of 
course perturb the natural conditions that make immunoprc- 
cipitation an attractive method. 

Cross-Linking 

Cross-linking is used in two ways to deduce protein-protein 
interactions. First, it is used to deduce the architecture of 
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FIG 2 Two-dimensional gels to identify cross-linked proteins in a complex^ 
Proteins that are not cross-linked have the same mobility in both dime ns.ons of 
the SDS gel and form a diagonal. Proteins that are cross-linked m.grate slow y in 
he firs, dimension; after cleavage of the cross-link with mercap.oethanol (2- 
MSH). these proteins migrate at their native posmons in the second d.mens.on 
and arc oft Ihe diagonal. 



proteins or assemblies that are readily isolated intact from the 
rell Second? it is used to detect proteins that interact with a 
given test protein ligand by probing extracts, whole cells, or 
partially purified preparations. ~ ■ 

Determination of architecture. The classical method of iden- 
tifying interacting partners in a purified protein complex in- 
volve! analysis by two-dimensional gels (Fig. 2). The procedure 
involves three steps. First, the complex is reacted with a cleav- 
able Afunctional reagent of the form RSSR', and the R and R 
groups react with susceptible amino acid side chains in the 
protein complex. This reaction forms adducts of the form P- 
RSSR'-P' Second, the proteins are fractionated on an bDb-gei 
in the absence of reducing agents. The gel separates the pro- 
teins based on molecular weight, and cross-linked proteins of 
the form P-RSSR'-P' migrate as species of greater molecular 
weight Third, a second dimension of the SDS-gel is run after 
treatment of the gel with a reducing agent to cleave the central 
S— S bond Un-cross-linked species align along the diagonal, 
because their molecular weights do not change after reduction. 
Cross-linked proteins migrate off the diagonal because they 
migrated as P-RSSR'-P' in the first dimension and as mole- 
cules of the form P-RSH and P'-R'SH in the second dimen- 
sion The cross-links are identified by their size, which corre- 
sponds to that of the un-cross-linked species P and P . this 
method has been discussed at a practical step-by-step level by 

Traut et al. (215). , . . f 

Cross-linking has been used to study the architecture of 
multienzyme complexes such as CF.-ATPase (7) and E. col, 
F -ATPase (21). It has also been used to study the structure ot 
much more complicated structures like the ribosome (41, 215). 
Since these structures are complex, the corresponding cross- 
linking pattern is necessarily complex. Furthermore, as might 
be expected different patterns are sometimes obtained as the 
reactive group is changed and as the distance between the 
react.ve groups is altered (41, 215). Several approaches have 
been taken to simplify the cross-linking patterns resulting from 
these experiments. In one approach, the proteins are prefrac- 
tionated on urea-acrylamide gels or on CM-Sepharose before 
diagonal electrophoresis (41, 217). A second approach involves 
runninc two-dimensional gels without cleaving the cross-link 
followed bv clution of individual species, cleavage of the cross 
link and resolution of the resulting proteins on a third gel (2.1) 
A third approach involves the use of antibody to identify cross 
linked partners after the use of appropriate gels (18U, LIl) 
Transfer of the gels followed by immunoblotting allows one tt 
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unequivocally identify each member of a cross-linked pair. 
Since this method is so powerful, onc-dimens.onal gels often 
suffice and noncleavable cross-linking reagents are easily used 
Since immunoblotting is also very sensitive, one can take care- 
to limit cross-linking to acceptably low levels. 

Detection of interacting proteins, (i) Detection in v.vu. 
Cross-linking in vivo can be accomplished with membrane- 
permeable cross-linking reagents followed by.unraunoprectpH 
iation of the ligand protein. This method was used to detect a 
60-kDa protein that interacts with Ras (48). Immunoprecipi- 
tation of this protein required both immune sera and cross- 
linking and was inhibited when excess Ras was added before 
immunoprecipitation. Since the cross-linked protein could be 
released from the immune complex by cleavage of the cross- 
link with dithiothreitol (but not by incubation of the immune 
complex in buffer), it was truly cross-linked. Since pretreat- 
ment of the cross-linking reagent with excess amino groups 
inhibited cross-linking but excess amino groups did not inhibit 
r-ro«-linking if cells were lysed in their presence, cross-linking 
must have occurred in vivo. The complex was reproduc.bly 
increased after mitogenic stimulation and could be detected in 
cells producing normal amounts of Ras. This experiment 

makes another point: at least in the:*, exp^...—..^, c.„.~ - 

ing before immunoprecipitation is a more sensitive technique 
than immunoprecipitation alone. 

(ii) Detection in vitro. The addition of an isolated protein or 
a peptide to a complex system offers a huge potential for 
precise and powerful cross-linking methods. Several different 
such methods have been used to detect interacting proteins. 

(a) Labeled peptide or protein. Detection of cross-linking 
partners is incomparably cleaner if the protein or peptide is 
labeled before cross-linking, because there is only one source 
of labeled material. For example, '"l-labeled gamma inter- 
feron was used to detect receptors that were cross-linked (192), 
and in vivo labeled interleukin-5 was purified before cross- 
linking to detect interacting receptors (147). 

Proteins are also routinely labeled in vitro with [ S]methi- 
onine during translation, and this was followed by cross-linking 
and by immunoprecipitation to detect protein interactions. 
This has been done, for example, to detect interaction of pre- 
prolactin and pre-p-lactamase with signal' sequence receptor 
and translocation chain-associating protein during glycosyla- 
tion (79) and to detect mitochondrial import proteins in con- 
tact with translocation intermediates (195, 204). 

(b) Photoaffinity cross-linking with labeled cross-Unking re- 
agent. A particularly useful reagent is the Denny-Jaffee re- 
agent a cleavable heterobifunctional photoactivatable cross- 
linking reagent that is labeled on the photoactivated moiety 
(49) This reagent can be coupled to an isolated protein, which 
is then incubated in an appropriate extract and photoactivated 
to cross-link nearby proteins. Since the label is on the photo- 
activatable moiety of the cross-linking reagent, it is transferred 
to the cross-linked protein after cleavage of the cross-linking 
reagent (Fig. 3). This cross-linking reagent has been used to 
identify a specific 56-kDa ZP3-binding protein on acrosome- 
intact mouse sperm (19). As much as 90% of the label initially 
on ZP3 could be transferred to the 56-kDa protein, and cross- 
linking was inhibited by excess unlabeled ZP3 protein. More- 
over ZP3 affinity columns retained a protein with the same 
molecular mass. This reagent has also been used to demon- 
strate that phospholamban interacts with a specific site on the 
ATPase from sarcoplasmic reticulum onlv when it is n on P» os - 
phorylated and the ATPase is in the Ca* -free sut e ( 106 . 

Another useful reagent of this type is '"l-{S-[N-(3-,odo-4- 
azidosalicyl)cysteaminyl]-2-thiopyridine>, also called IAC a 
cysteine-specific modifying reagent. This reagent was used to 
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demonstrate that the carboxy-terminal region of the subumi of 
E coli RNA polymerase was adjacent to the activating domain 
of the catabolite activator protein (CAP) (33). To do this, a 
unique cysteine was introduced onto the surface of CAP, in a 
residue which toleFates a large number of mutations, and a 
preexisting surface cysteine was changed to serine. Subsequent 
reaction with labeled IAC resulted in quantitative incorpora- 
tion of label and in protein with 70% of its transcription acti- 
vation activity. Irradiation of the ternary complex of DNA 
CAP and RNA polymerase yielded 20% cross-linking, all of 
which was with a particular domain of the subunit of poly- 
merase. . . , 

(c) Direct incorporation of photoreactive lysine derivative dur- 
ing translation. A photoactivatable group can be incorporated 
directly into the translation product by using a modified lysyl- 
tRNA If translation is done in the presence of [ Sjmetni- 
onine the protein is simultaneously labeled and ready for 
photoactivated cross-linking. This approach has been particu- 
larly valuable in investigating the process by which proteins are 
inserted into the endoplasmic reticulum. During elongation 
signal recognition particle (SRP) binds the nascent chain and 
halts translation until the arrested translation product is 
brought to the SRP receptor. This releases SRP, allowing 
translation to continue, coupled with translocation of the pro- 
tein into the endoplasmic reticulum. With bovine preprolactin, 
there are two lysines at positions 4 and 9 of the signal sequence 
and no other lysine residues within the first 70 ammo acids, 
after which translation is normally stopped by SRP. lHus, 
incorporation of lysine with a ph6toactivated group specifically 
probes interaction of the signal sequence with other interacting 
proteins In this way, the nascent chain was specifically cross- 
linked with the 54-kDa protein of SRP and a 35-kDa micro- 
somal membrane protein, called the signal sequence receptor 
(239) Subsequent experiments in the same system relied on 
translation of truncated mRNAs bearing lysine codons at dif- 
ferent positions. These templates produce proteins that remain 
tethered to the ribosomc through peptidyl-tRNA because of 
the lack of a termination codon. They therefore cannot com- 
plete translocation and are trapped, presumably as intermedi- 
ates In this way, it was shown that lysines in different positions 
also recognized the same 35-kDa membrane protein (121, 
Moreover, this protein is probably required for translo- 
cation because antibodies against it inhibit translocation in 

vitro (87). . . 

Investigation with the same system in S. cerevisiae demon- 
strated that prepro-a-factor is in contact with Sec61 protein 
(155) Antibodv against either Sec61 or prepro-a-factor pre- 
cipitated the same labeled cross-linked protein. Cross-linking 
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was observed only when prepn»-«-factor was tethered; release 
of the protein with puromycin or a complete translation se- 
quence abolished cross-linking. Moreover, the tethered pre- 
pro-a-factor was glycosylated while it was tethered, and cross- 
linking was ATP dependent for large tethered prepro-a-factor 
peptides; this indicated that prepro-a-factor had cn.cred u 
normal glycosylation pathway. Sanders et al. (191) also dem- 
onstrated by conventional cross-linking followed by^mmuno- 
precipitation that SecM is in contact with tethered proteins 
being translocated (in this case by talent couplmg to av din) 
the same experiments also demonstrated that BiP (Kar2) was 
cross-linked to the translocation intermediates and that secbl 
and sec63 mutants modulate the process. The convergence of 
genetics and biochemical cross-linking studies further substan- 
tiates these interactions. 

(d) Site-specific incorporation of photoreactive amino acid de- 
rivative during translation. Use of a suppressor tRNA to incor- 
porate a photoactivatable amino acid derivative results in site- 
specific incorporation by use of a gene carrying a s.ngle stop 
codon Two such reports have been described. High et al. (9-,; 
used a charged amber suppressor tRNA to insert a phenylal- 
anine derivative into various regions of the signal sequence of 
preproiactin. Cross-linking experiments demonstrated that the 
ammo-terminal end of the signal sequence is in prox.m.ty to 
the translocating chain-associating protein, whereas the hydro- 
phobic core of the sequence contacts Sec61 protein Cornish et 
al (39) used a similar method to incorporate a different pho- 
toaffinity label. Still to be described is a similar method involv- 
ing a labeled photoactivated amino acid replacement— the ul- 
timate magic bullet. 

(iii) Other considerations. One major disadvantage of using 
any cross-linking technique to detect protein-protein interac- 
tions is that it detects nearest neighbors which may not be in 
direct contact. The cross-linking reagent reaches out to any 
protein in close vicinity; thus, it may appear to detect pro- 
tein interactions that are more like ships just passing in he 
night. This is more and more of a problem as the size of the 
cross-linking reagent increases. Any interaction detected by 
cross-linking should therefore be independently assessed for 
protein-protein interactions. However, cross-linking has three 
important advantages over other methods. First it can ce- 
ment" weak interactions that would otherwise not be visible by 
other methods (see, for example, reference 48). Second, it can 
be used to detect transient contacts with d.fferent proteins at 
various stages in a dynamic process such as glycosylation, by 
freezing the process at different stages. Third, cross- inking can 
be done in vivo with membrane-permeable cross-linking re- 
agents (48). It may also be possible to detect cross-linking in 
vivo after microinjection of a protein that is modified with a 
photoactivatable cross-linking group. To our knowledge, tins 
has not yet been reported. 

LIBRARY-BASED METHODS 

A variety of methods have been developed to screen large 
libraries for genes or fragments of genes whose products may 
interact with a protein of interest. As these methods are by 
their nature highly qualitative, the interactions identified must 
be subsequently confirmed by biochemical approaches^ How- 
ever, the enormous advantage of these strategies is that the 
genes for these newly identified proteins or peptides are im- 
mediately available. This is in sharp contrast to the biochemical 
methods described in the section on physical methods to select 
and detect proteins that bind another protein, above, which 
generally result in the appearance of bands on a polyacryl- 
amide gel. These library methods also differ from classical 
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genetic techniques described in the section on genetic meth- 
ods below, which often require a specific phenotype before 
they can be carried out. Library screens are generally per- 
formed in bacteria or yeasts, organisms with rapid doubling 
times. Thus, these procedures can be completed rapidly. 

Protein Probing 

A labeled protein can be used as a probe to screen an 
expression library in order to identify genes encoding proteins 
that interact with this probe. Interactions occur on nitrocellu- 
lose filters between an immobilized protein (generally ex- 
pressed in E. coli from a Xgtll cDNA library) and the labeled 
probe protein (Fig. 4). The method is highly general and there- 
fore widely applicable, in that proteins as diverse as transcrip- 
tion factors and growth factor receptors have been used as 
probe. A variety of approaches can be used to label the protein 
ligand, or this ligand can be unlabeled and subsequently de- 
tected by specific antibody. 

The method is based on the approach of Young and Davis 
(244) who showed that an antibody can be used to screen 
expression libraries to identify a gene encoding a protein an- 
tigen The Xgtll libraries typically use an isopropyl-|J-D-tnio- 
galactopyranoside (IPTG)-inducible promoter to express pro- 
teins fused to R-galactosidase. Proteins from the bacteriophage 
plaques are transferred to nitrocellulose filters, incubated with 
antibody, and washed to remove nonspecifically bound anti- 
body Protein ligands were first used as probes in this type of 
experiment by Sikela and Hahn (200), who identified a brain 
calmodulin-binding protein with ^I-labeled calmodu in as the 
probe The Xgtl 1 -expressed fusion protein bound calmodulin 
with a K. between 3 and 10 nM, and binding was dependent on 
the presence of Ca 2 + . The signal-to-noise ratio in these exper- 
iments was higher than that found with various antibody 

Pr MacGregor et al. (135) used the leucine zipper and DNA- 
binding domain of Jun as a probe and identified the rat cAMP 
response element-binding protein type 1. In this case, the Jun 
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domain was biotinylaled and detected with a s. re pt a v.d.n -al- 
kaline phosphatase conjugate. Buffer conditions «'» J Ik ad- 
justed to distinguish a Jun-Jun homodimer from the more 
stable Fos-Jun hctcrodimcr. Ulackwood and bisenman (17 
used a similar approach with the basic-reg.on hcUx-loop hehx 
leucine zipper domain (bHLH-z.p) of the c -Myc pro cm 
92-residue carboxy terminus of Myc, containing this dom-.n, 
was expressed as a GST fusion protein punned by g^ath.one- 
agarose affinity chromatography and '*I labeled. This probe 
identified a new bHLH-zip protein termed Max and ^ge shift 
experiments indicated that the Myc-Max complex ■ exh.bucd 
site-specific DNA binding under conditions where ne ther Myc 
nor Max alone could bind. These results were critical m estab- 
lishing a long-sought role for the Myc protein. Extending this 
result" Ayerlt al. (6) used Max as a labeled probe to ident.fy 
another member of this class, termed Mad. 

A major advantage of the protein-probing approach is that 
the protein probe can be manipulated in vitro to provide, for 
example, a specific posttranslational modification or a metal 
cofaclor.' This' modification or cofactor may be essentia, Jo, the 
ability of the probe to bind to other proteins. This feature ot 
the approach was exploited in the Ca r * -dependent binding of 
calmodulin (200). Skoinik et al. (201) -tended his use 
phosphorylated probes in order to find proteins that bind to 
fhe carboxy-terminal phosphorylated tail of the epidermal 
growth factor (EOF) receptor. This tail is par. of the intracel- 
lular domain of the receptor, which P°^^_ a P™^'" ^ t 
rosine kinase activity stimulated by binding of EOF Skolmk ^c 
al purified this domain from cells infected with a recotnb.nan 
baculovirus, tyrosine phosphorylated it in vitro, and c eaved it 
to separate the phosphorylated carboxy-terminal tailtom the 
protein kinase domain. Probing an expression library identified 
proteins containing the SH2 domain, which recognizes phos- 
photyrosyl-containing peptides. Th.s cloning approach might 
be applied to the identification of proteins interacting with 
other activated phosphorylated receptors including tyrosine- 
and serine-specific phosphatases as well as kinases. In addition 
it should be possible to modify probe proteins by means other 
than phosphorylation to identify new proteins that recognize 
such modifications. . 

Probing expression libraries with labeled protein has numer- 
ous advantages. Since any protein or protein domain can be 
specifically labeled for use as a probe, the sophisticated arsenal 
of GST fusion vectors, other expression and tagging systems, 
and in vitro translation systems can be exploited; this makes 
preparation of the probe relatively straightforward. If specific 
antibody to the target protein is available, the probe protein 
need not be labeled; the antibody can be used in a second step 
to detect plaques that have bound the target protein. More 
than 10 ft plaques can be screened in an experiment, plating 5 x 
10* plaques per 150-mm dish. The method not only results in 
the immediate availability of the cloned gene for the interact- 
ing protein but also can provide data regarding a specific do- 
main involved in the interaction, because the Xgtl Im* ;rt is 
often only a partial cDNA. Conditions of the wash cycles can 
be adjusted to vary the affinity required to yield a signal. As 
with many library-based methods, probing ex press.on library 
compares equally all binary combinations of the probe s protein 
and a library-encoded protein. Thus, less abundant proteins, 
proteins with weak binding constants, and P^te.ns that ^ tem- 
porally or spatially rarely interact with the probe protein in 
vivo can all be detected as long as their transcripts are present 

ova. , i in npn/-rafp the hbrarv. 

This method has certain intrinsic limitations. Proteins en- 
coded by the library must be capable of folding correctly in £ 
col. generally as usion proteins, and of maintaining their 



simcUne on a nitrocellulose filter. However, proteins of ten can 
he rena.ured by subjecting the fillers to a denaturation-rena- 
luration cycle with 6 M guanidine hydrochloride as described 
bv Vinson et al. (222). Binding conditions are arbitrarily im- 
posed by the investigator, rather than reflecting the native 
environment of the cell. Since all combinations of protein 
protein interactions are assayed, including those that might 
never occur in vivo, the possibility of identifying artifactual 
partners exists. In particular, the relative abundance of each 
potential partner expressed in a colony or plaque of the library 
is similar, instead of varying and potentially being compart- 
mentalized as in the cell. Any posttranslational modifications 
necessary for efficient binding will generally not occur in bac- 
teria (although some such modifications can be performed in 
vitro) Screening rather than direct selection is the means of 
detection which inherently limits the number of plaques that 
can be assayed. The use of screening also restricts the further 
genetic manipulations that can be applied to the cDNA inserts. 
For example, in the analysis of point mutations, it is not pos- 
sible to select directly for rare mutations that affect the inter- 
action Different protein probes are likely to behave variably in 
this approach, such that binding and washing conditions may 
have to be adjusted, in each case to maximize the signal-to- 
noise ratio. 

Phage Display 

Basic approach. Smith (203) first demonstrated that an E. 
coi, filamentous phage can express a fusion protein bearing a 
foreign peptide on its surface. These foreign amino acids were 
accessible to antibody, such that the "fusion phage could be 
enriched over ordinary phage by immunoaffinity purification. 
Smith suggested that libraries of fusion phage might be con- 
structed and screened to identify proteins that bind to a spe- 
cific antibody. In the past few years, there have been numerous 
developments in this technology to make it applicable to a 
variety of protein-protein and protein-peptide interactions. 

Filamentous phages such as Ml 3, fd, and fl have approxi- 
mately five copies of the gene III coat protein on their surface; 
thus a foreign DNA sequence inserted into this gene results m 
multiple copies of the fusion protein displayed by the phage. 
This is called polyvalent display. Similarly, the major coat pro- 
tein encoded by gene VIII can also display a foreign insert 
n04) The gene VIII protein allows up to 2,700 copies of the 
insert per phage. Generally, polyvalent display is limited to 
smaU p P eptides g (see the next section) because larger inserts 
interfere with the function of the coat proteins and the phage 
become poorly infective. 

Random sequences can be inserted into gene III or gene 
VIII to generate a library of fusion phage (Fig. 5). Such a 
library can then be screened to identify specific phage that 
display any sequence for which there is a binding partner, such 
as an antibody. This screening is performed by a series of 
affinity purifications known as panning. The phage are bound 
to the antibody, which is immobilized on a plastic dish. Phage 
that do not bind are washed away, and bound phage are eluted 
and used to infect E. coli. Each cycle results in a 1,000-fold or 
greater enrichment of specific phage, such that after a few 
rounds DNA sequencing of the tight-binding phage reveals 
only a small number of sequences. In addition to the advantage 
of high selectivity, a second advantage of this technology is that 
laree" phage libraries can be constructed (up to 10 to H) 
complexity) and the affinity purification step can be carried out 

at very hit»h concentrations ot pnage t ^yc , 

(50) Third the direct coupling of the fusion protein to its gene 
in a single phage allows the immediate availability of sequence 
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FIG 5 A peptide library in a filamentous phage vector. The figure illustrates 
the process of panning, by which peptides that bind to an adsorbent are identi- 
fied. 



data to generate one or more consensus sequences of bound 
peptides or the sequences of variant proteins with a specific 
phenotype. Fourth, the phage can be used directly to assess the 
binding specificity of the encoded fusion proteins by varying 
the stringency of the wash procedures used in the panning 
cycles. 

Random-sequence peptide libraries have been generated by 
cloning synthetic oligonucleotides into gene III (Fig- 5). Scott 
and Smith (198) generated a hexapeptide library and screened 
it to identify epitopes for two monoclonal antibodies specific 
for a hexapeptide from the protein myohemerythrin. Cwirla et 
al. (44) constructed a similar hexapeptide library to find pep- 
tides that can bind to a monoclonal antibody specific for a 
tetrapeptide from p-endorphin. Such epitope libraries allow 
rapid characterization of an unknown epitope recognized by 
either a monoclonal antibody or polyclonal serum. For exam- 
ple, monoclonal antibody pAB240, which recognizes the mu- 
tant conformation of the tumor suppressor p53 protein was 
shown to bind to a 5-amino-acid motif in p53 (210). The bind- 
ing partner for the phage-encoded peptides need not be an 
antibody. For example, Devlin et al. (50) constructed a 15- 
residue peptide library and used it to identify nine different 
peptides that bind to streptavidin. 

A major advance in phage display came with the develop- 
ment of a monovalent system in which the coat protein fusion 
is expressed from a phagemid and a helper phage supplies a 
large excess of the wild-type coat protein (11, 131). Therefore, 
the phage are functional because the recombinant protein 
makes up only a small amount of the total coat protein. The 
vast majority (>99%) of the population of phage particles 
display either one or no copies of the fusion protein on their 
surface. Such phage can accommodate 50 kDa of foreign pro- 
tein without any significant effect on phage infectivity. In ad- 
dition, monovalent phage display avoids potential avidity ef- 
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fects observed with polyvalent display, in which the phage can 
attach to the adsorbent at multiple points. 

Phage display has also been used to identify proteins with 
increased binding affinity. In some cases, the use of monova- 
lent display was necessary to avoid potential avidity effects, 
attributed to multipoint attachment of the polyvalent phage to 
the absorbent (231). Lowman et al. (131) expressed nearly one 
million mutants of human growth hormone (191 residues) as 
fusion phage and identified variants that bound tightly to the 
growth hormone receptor. The mutations were directed to 12 
sites known to be important for binding to the receptor. Some 
variants had binding affinities up to eightfold greater than that 
of the wild-type hormone. Roberts et al. (186) used polyvalent 
display of bovine pancreatic trypsin inhibitor and directed mu- 
tagenesis to five residues of the protein. They selected for 
high-affinity inhibitors of human neutrophil elastase and iden- 
tified one variant with an affinity 3.6 X 10 6 higher than that of 
wild-type bovine pancreatic trypsin inhibitor. 

A similar strategy can be used with nontargeted mutagene- 
sis. For example, Pannekoek et al. (167) expressed human 
plasminogen activator inhibitor 1, a 42-kDa protein, as a gene 
III protein fusion under conditions for monovalent display. 
The phage-dlsplaycd inhibitor could specifically form com- 
plexes with serine protease tissue-type plasminogen activator. 
PCR mutagenesis was used to generate a library of mutant 
plasminogen activator inhibitor 1 proteins, which can be 
screened to analyze structure-function relationships. 

Phage display presents several advantages for the study of 
protein-protein interactions. The very large sizes of either ran- 
dom libraries or pools of individual variants of a single se- 
quence that can be generated mean that complex mixtures can 
be screened. While not strictly a genetic approach, in that there 
is no direct selection for an interacting partner, phage display 
has many of the properties of genetic selection through its use 
of panning cycles. It is a rapid procedure and should be widely 
applicable. Although screening a random library of cDNA by a 
panning procedure to identify proteins that interact with a 
protein of interest has not yet been demonstrated, this strategy 
should prove workable. 

Disadvantages of phage display include the size limitation of 
protein sequence for polyvalent display; the requirement for 
proteins to be secreted from E. coli\ and the use of a bacterial 
host which may preclude the correct folding or modification of 
some proteins. All phage-encoded proteins are fusion proteins, 
which may limit the activity or accessibility for binding of some 
proteins. Since binding is'detected in vitro, the same consid- 
erations of an in vitro approach that are relevant for protein 
probing of expression libraries are relevant here. 

Related methods, (i) Antibody phage. While we do not spe- 
cifically address the vast topic of antigen-antibody interactions 
in this review, it is worth noting that phage display can be 
applied to these interactions. The principle of displaying anti- 
body-combining domains on the surface of phage was first 
demonstrated by McCafferty et al. (141). The heavy- and light- 
chain variable domains of an anti-lysozyme antibody were 
linked on the same polypeptide and expressed as a gene III 
protein fusion. Over 1,000-fold enrichment of the antibody 
could be obtained by a single passage over a lysozyme-Sepha- 
rose column. This method was then extended by this and other 
groups to allow the display of libraries of combining domains, 
such that new antibodies or mutant versions of existing anti- 
bodies could be generated. 

Kang et al. (110) used a vector to express a combinatorial 
library of functional Fab molecules (~50-kDa heterodimer) on 
the surface of a phage. The Fd chain, consisting of the variable 
region and constant domain 1 of the immunoglobulin heavy 
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chain, was synthesized as a gene VIII protein fusion, while the 
k light chain contained no phage sequence. 1 he two chains 
could assemble in the bacterial periplasm and become incor- 
porated into the phage on comfeclion with helper. I hagc co- 
Lined 1 to 24 antigen binding sites per particle. The vector 
system described allows recombination of the two chains to 
generate large combinatorial libraries. A similar strategy to 
. express Fabs by using the gene III proteurha? also been de- 
scribed : (1G). Additionally, a combinatorial library oi linked 
heavy- and light-chain variable genes fused to the gene III 
protein has been shown to be capable of detecting a high- 
affinity binder (37). Kang et al. (110) suggested that such sys- 
tems can be used for mutation and selection cycles to generate 
high-affinity antibodies. Moreover, they envisioned that the 
syftems can be extended to analyze any protein recognition 
system, such as ligand-receptor interactions. 

Phage displav of Fab fragments was extended by Burton et 
al (26), who generated a library of such fragments trorr .the 
RNA of a human immunodeficiency virus-positive individual. 
After four rounds of panning with immobilized surface g yco- 
protein g P 120 of the virus as the adsorbent, specific viral a n- 
iihodiw were obtained. A similar method was used to obtain 
human antibody Fabs that recognize the hepatitis B surtace 

an (lf) e peptidcs on plasmids. In a method highly analogous to 
phage display, random peptides are fused to the C terminus of 
the £ coli Lac repressor and expressed from a plasm.d that 
also contains Lac repressor-binding sites (43). Thus the pep- 
tide fusions bind to the same plasmid that encodes them. The 
bacterial cells are lysed, and the peptide libraries are screened 
for peptides that bind to an immobilized receptor by using 
similar panning cycles to those for phage libraries. In this case, 
peptides become enriched because bound peptides carry their 
encoding plasmids with them, via the repressor-opcrator inter- 
action, and these plasmids can be transformed back into £ 
coli. In the initial example, peptides that bind to a monoclonal 
antibody specific for dynorphin B were selected, and these 
peptides contained a hexapeptide sequence similar to a seg- 
ment of dynorphin B (43). This method is distinguished from 
the phage display methods in that the peptides are exposed at 
the C terminus of the fusion protein and the fusions are cyto- 
plasmic rather than exported to the periplasm. 

Two-Hybrid System 

The two-hybrid system (35, 65, 66) is a genetic method that 
uses transcriptional activity as a measure of protein-protein 
interaction. It relies on the modular nature of many site-spe- 
cific transcriptional activators, which consist of a DNA-bmd.ng 
domain and a transcriptional activation domain (23, V/, ill). 
The DNA-binding domain serves to target the activator to the 
specific genes that will be expressed, and the activation domain 
contacts other proteins of the transcriptional machinery to 
enable transcription to occur. The two-hybrid system is based 
on the observation that the two domains of the activator need 
not be covalently linked and can be brought together by the 
interaction of any two proteins. The application of this system 
requires that two hybrids be constructed: a DNA-b.nding do- 
main fused to some protein, X, and a transcription activation 
domain fused to some protein. Y. These two hybrid ; are ex- 
pressed in a cell containing one or mo^er^rter genes. If the 
X and Y proteins interact, they create a functional activator by 
bringing the activation domain into close proximity wuh J.v 
DNA-binding domain; this can be detected by expression of 
the reporter genes (Fig. 6). While the assa y has been genera lly 
performed in yeast cells, it works similarly in mammalian cells 
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(see, e.g., reference 46) and should be applicable to any other 
eukaryotic cells. . 

This method has been used with a wide variety of proteins, 
including some that normally reside in the nucleus, cytoplasm, 
or mitochondria, are peripherally associated with membranes, 
or arc extracellular (see reference 66 for a review). It can be 
u^ed to detect interactions between candidate proteins whose 
eenes are available by constructing the appropriate hybrids and 
festing for reporter gene activity (220, 249). If an interaction is 
detected, deletions can be made in the DNA encoding one of 
the interacting proteins to identify a minimal domain for in- 
teraction (35). In addition, point mutations can be assayed to 
identify specific amino acid residues critical for the interaction 
(127). Most significantly, the two-hybrid system can be used to 
screen libraries of activation domain hybrids to identify pro- 
teins that bind to a protein of interest. These greens result in 
the immediate availability of the cloned gene for any new 
protein identified. In addition, since multiple clones that en- 
code overlapping regions of protein are often identified the 
minimal domain for interaction may be readily apparent from 
the initial screen (105, 223). . 

A variety of versions of the two-hybrid system exist, com- 
monly involving DNA-binding domains that derive from the 
yeast Gal4 protein (35, 55) or the E. coh LxxA protein (223, 
247) Transcriptional activation domains are commonly de- 
rived from the Gal4 protein (35, 55) or the herpes simplex virus 
VP16 protein (45). Reporter genes include the E coh tact 
eene (65) and selectable yeast genes such as HIS3 (55) and 
LEU2 (247) An increasing number of activation domain li- 
braries are becoming available, such that screens are now fea- 
sible for proteins from many different organisms or specific 
mammalian tissues. 

One field in which the two-hybrid system has been applied 
with considerable success has been the study of oncogenes and 
tumor suppressors and the related area of cell cycle control. 
For example, reconstruction experiments with previously 
cloned proteins indicated that interactions occur between Ras 
and the protein kinase Raf (220, 249), human Sosl guanine 
nucleotide exchanger and the growth factor receptor-associ- 
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ated protein Grb2 (30), and Rat and the transcription factor 
inhibitor IkB (129). Two-hybrid searches with oncoproteins or 
tumor suppressors as targets have identified a leucine zipper 
protein that binds to Jun (34); protein phosphatase II ^ 
which binds to Rb (55); a bHLH-zip protein Mxil . which binds 
to the Myc-associated protein Max (247); and the Rb-rclated 
protein pl30, which binds to cyclins and was identified through 
its interaction with the cyciin-dependeat kinase Cdk2 (83). A 
notable convergence of different approaches came about with 
the identification of another protein that binds to Cdk2, a 
21-kDa protein termed Cipl, which inhibits the kinase activity 
(85) This protein turned out to be identical to a protein en- 
coded by the major p53-inducible transcript (58), suggesting 
that the tumor suppressor role of p53 may be mediated by its 
activation of the gene for this 21-kDa protein. 

The two-hybrid system has several features that make it 
useful for analysis of protein-protein interactions. It is highly 
sensitive, detecting interactions that are not detected by other 
. , , a ., -^f^r^^^p,- m . iri A ??fn On the basis of 
methods (see, e.g., reterun^es anc 

binding of different proteins to the retinoblastoma protein, 
Durfee et al. (56) estimate that the minimal binding constant 
ren.jired to detect an interaction in their version of the two- 
hybrid system is on the order of 1 m-M This value suggests that 
the system should be applicable to a wide range of protein 
interactions. However, it is clear that the minimal affinity in- 
teraction detectable wilt depend on such variables as the level 
of expression of the hybrid proteins; the number, sequence, 
and arrangement of the DNA-binding sites in the reporter 
gene(s)- and the amount of reporter protein required for a 
detectable phenotype. Given these variables, it is likely that 
some versions of the system may detect weak interactions with 
binding constants considerably greater than 1 p.M. Another 
advantage is that the interactions are detected within the na- 
tive environment of the cell and hence that no biochemica. 
purification is required. The use of genetic-based organisms 
like yeast cells as the hosts for studying interactions allows both 
a direct selection for interacting proteins and the screening ot 
a large number of variants to detect those that might interact 
either more or less strongly. With a reporter gene such as the 
yeast HIS3 gene, the competitive inhibitor 3-aminotnazole can 
be used to directly select for constructs which yield increased 

affinity. . , 

The two-hybrid system is limited to proteins that can be 
localized to the nucleus, which may prevent its use with certain 
extracellular proteins. Proteins must be able to fold and exist 
stably in yeast cells and to retain activity as fusion proteins. 1 he 
use of protein fusions also means that the site of interaction 
may be occluded bv one of the transcription factor domains. 
Interactions dependent on a posttranslational modification 
that does not occur in yeast cells will not be detected. Many 
proteins, including those not normally involved in transcrip- 
tion, will activate transcription when fused to a DNA-binding 
domain (134), and this activation prevents a library screen 
from being performed. However, it is often possible to delete 
a small region of a protein that activates transcription and 
hence to remove the activation function while retaining other 
properties of the protein. 

Other Library-Based Methods 

A number of other library strategies have been developed 
recently that, in principle, should result in the identification ot 
proteins that interact with a protein of interest. However, be- 
cause the first description of methods generally involves known 
combinations of proteins, the general applicability of a new 
method cannot be easily judged. 
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In one approach, the ability of the E. colt bacteriophage X 
repressor to dimenze was used as a reporter for the interaction 
of leucine zipper domains (98). The N-terminal domain of 
repressor binds to DNA but dimerizes inefficiently; a separate 
C-tcrminal domain that mediates dimerization is required for 
efficient binding of the protein to its operator. The N-tcrmmal 
DNA-binding domain was fused to the leucine zipper of the 
yeast Gcn4 protein, which allowed dimerization and repression 
of transcription in £. coli. This repression enabled the host cell 
to survive superinfection by \ phage. This phenomenon en- 
abled Hu et al. (98) to introduce single-amino-acid mutations 
into the leucine zipper domain and to use a genetic assay in E. 
coli to determine whether dimerization of the zipper domain 
occurred. They suggested that this assay could be used to select 
clones from a library for proteins that bind to a target protein, 
which is expressed in E. coli as a repressor hybrid. Any phage 
that express a protein that binds to the target protein should 
compete for dimerization of the repressor and its ability to 
bind X operators. These phage would be detected because they 
result in plaques. As described, this approach would be limited 
to target proteins that homodimerize. In addition, this method 
when applied to library screening is a competition assay; it 
would require that the library-encoded protein bind to the 
target protein in preference to the target protein interacting 
with itself. 

Another E. co/i-based assay involves tagging the target pro- 
tein with biotin by fusing it to the biotin carboxylase carrier 
protein (74) This tag allows the protein to be bound by avidin, 
streptavidin, or anti-biotin antibody-coated filters. Potential 
interacting proteins are fused to the LacZ protein and ex- 
pressed from a X vector such that p-gaiactosidase activity is 
intact These phage are infected into cells containing the bi- 
otin-tagged target protein, and interaction can occur in vivo 
between a library-encoded protein and the target protein. 1 his 
interaction is then detected when the phage plaques are trans- 
ferred to avidin filters and assayed for p-galactosidase activity. 
The method was shown to work by using biotinylated c-Jun 
protein and a c-Fos-LacZ fusion. Although the protein-pro- 
tein interaction occurs within the living bacterial cells, the 
detection of this interaction occurs in vitro on filters that must 
be washed after transfer of the proteins. Thus, in principle, this 
method may have many of the same limitations that protein 
probing of expression libraries has. 



GENETIC METHODS 

For organisms for which powerful genetic analysis methods 
exist sophisticated strategies can be designed to uncover genes 
that show interactions with other genes. In many cases, these 
newly uncovered genes encode proteins that physically interact 
with proteins encoded by the known genes. In other cases, 
cenetic methods can be used to confirm interactions among 
previously identified proteins. These strategies are generally 
based on classical genetic approaches. For example, identifi- 
cation of extragenic suppressors often reveals mutations in 
genes whose products physically interact with the protein con- 
taining the original defect. Synthetic lethal screens yield mu- 
tations that, in combination with another nonlethal mutation, 
result in the inability of the organism to grow; this phenotype 
is commonly due to alterations in interacting proteins. Over- 
production of certain proteins can lead to the suppression of 
mutations in interacting proteins. In other cases, overproduc- 
tion disrupts a cellular process by altering the balance of the 
different components of a complex structure, or the overpro- 
duced protein is nonfunctional and acts in a dominant-negative 
manner. 
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FIG. 7. Extragcnic suppression due to restorat.on of a protein-protein inter- 
action. 



The value of some of these genetic approaches has been 
significantly increased by applying them to organisms not ame- 
nable to classical genetic techniques, using modern molecular 
tools For example, the ability to generate mice either carrying 
novel genetic information or deleted for one or more of their 
endogenous genes allows this organism to be analyzeu by some 
of the logic formerly reserved for much simpler creatures. 
However, it must be kept in mind with any genetic approach 
that identification of mutants with the correct phenotypes does 
not guarantee that the biochemical mechanisms invoked to 
explain these phenotypes are correct. 

Extragcnic Suppressors 
Suppressor mutations are mutations that partially or fully 
revert the phenotype caused by an original mutation (see ref- 
erence 86 for review). Extragenic suppressors occur in genes 
other than the gene carrying the primary mutation This is 
illustrated in Fig. 7, in which a mutation of protein Y to Y 
compensates for the defect X* to restore activity to the XY 
dimcr. However, analysis of these suppressors is often difficult, 
because they lack anv phenotype in the absence of the primary 
mutation. To circumvent this problem, Jarvik and Botstein 
(107) sought suppressors of temperature-sensitive mutants ot 
phage P22 that resulted in a cold-sensitive phenotype. This 
cold-sensitive phenotype did not necessarily depend upon the 
presence of the original mutation causing temperature sensi- 
tivity and thus mutations in new genes could be uncovered. It 
was proposed (107) that one mechanism of this suppression is 
that the original mutation and the suppressor he in genes 
whose products physically interact and that the original muta- 
tion destroyed this interaction. The suppressor then produces 
a compensating alteration that restores the interaction. 

This type of suppressor analysis has been exploited in study- 
ing fundamental processes in yeast cells, particularly cell cycle 
control, cytoskeleton structure, and RNA splicing. Moir et al 
(152) isolated cold-sensitive cell division cycle (ede) mutants ot 
Saccharomyces cerevisiae and used them to identify tempera- 
ture-sensitive revertants. Some of these revertants earned new 
mutations that alone resulted in a ede phenotype at the restric- 
tive temperature, suggesting that the mutated gene products 
might interact with the cold-sensitive protein. These results 
support the idea that only a few genes might be capable ot 
mutation to generate an altered product that can suppress the 
original mutation. Thus, this approach can be applied to a 
process such as cell cycle control and reveal most or all oi tne 
interacting gene products. ^dtivp 
In a similar strategy, suppressors of a temperature-sensit ye 
mutation in the S cerevisiae actin gene that f^ re ^^^ 
sensitive phenotype identified five new genes (160) Mutations 
in these genes, even in a background with the wi Id-type act n 
gene led to phenotypes similar to those of actm mutants. 
These results suggested that these genes could encode proteins 
that are part of the actin cytoskeleton. In a related approach, 



dominant suppressors of an actin mutation also identified a 
gene whose product may interact with actm (3) in both these 
cases, the suppressor mutations showed allele specificity; some 
but not all actin alleles were suppressed by a given mutation. 
This allele specificity also supported the idea of a direct phys- 
iral interaction, in that suppressor mutations that simply by- 
pass the requirement for the protein containing the original 
mutation would not be expected to show such specificity. 

The nematode Caenorhabditis elegans has also been used 
extensively for suppression analysis because large populations 
of individuals can be examined (96). If a temperature-sensitive 
mutant is available, it can be shifted to the restrictive temper- 
ature to apply a direct selection for suppressors. This approach 
has been used to study such processes as movement, egg laying, 
and sex determination. One example is the suppression of an 
unc-22 mutation that resulted in muscle twitching (151). Some 
of these suppressors were mutations in the unc-54 gene which 
encodes the major myosin gene. These results suggested that 
the unc-22 and unc-54 proteins physically interact, and this 
idea is supported by the finding that the unc-22 protein, like 
myosin, is located in the A-bands of muscle (150). 

Suppressor analysis can clearly uncover new mutations that 
affect a process under study, and analysis of the genes and 
proteins defined by these mutations sometimes indicates inter- 
acting proteins. While often used with temperature-sensitive 
and cold-sensitive mutations, many other types of spontaneous 
mutations can also be readily suppressed if an appropriate 
genetic selection is available. With the availability of numerous 
cloned genes, conditional alleles can now be generated by in 
vitro mutagenesis methods. An obvious limitation of this type 
of analysis is that it can generally be applied only to simple 
organisms such as phages, bacteria, yeasts, nematodes, and 
Drosophila species. It requires not only the gene of interest but 
also a useful mutant to initiate the analysis. For example, 
suppressors in an interacting protein may be difficult or impos- 
sible to obtain if the original mutation does not affect a domain 
of interaction. Furthermore, other mechanisms can yield sup- 
pressors These include second intragenic mutations, gene du- 
plication of the original mutant gene, suppression by epistasis 
and informational suppression (see, for example, ^ence 
96) Thus, identification of the suppressors of interest against a 
background of these other mutations can be a time-consuming 
process. 

Synthetic Lethal Effects 

Mutations in two genes can cause death (or another observ- 
able defect) while mutation in either alone does not This 
phenomenon is called a synthetic effect and can result from 
physical interactions between two proteins required for the 
same essential function. This is illustrated in F.g^ 8, in which 
the dimer XY is required for some function and loss of this 
function results in a detectable phenotype. Mutation in X or Y 
yields partial binding, but the double mutant X*Y* has no 
binding Dobzhansky (52) first described synthetic lethal effects 
in Drosophila species. However, the search for synthetic lethal 
effects has been applied successfully most often in S. cerevisiae. 
One of the tools available for research in this organism is a 
colony-sectoring assay (93, 119), in which cells containing a 
plasmid are red and can therefore be easily distinguished from 
those that have lost the plasmid and are white. If maintenance 
of the plasmid is not essential for viability of the yeast, colonies 
appear with red-and-white sectoring. If the cells become de- 
pendent on a gene carried by the plasmid, the colonies appear 
uniformly red. For example, Bender and Pringle (15) used such 
an assav with a plasmid-borne copy of the MSB1 gene, which 



Mickobioi .. IvI V. 



10» 



rill'/K KY AND I it I D 1 ' 
A. 





or 





Wild type: full binding 

FIG. 8. Synthc 



Single mutant: partial binding 



Double mutant: no binding 



nc efrec , in w hich either s.ngle muwn. is function., but no, the doub.e muunt. 



plays a role in bud formation. They mutagenized th eptanMJ 
STe^^^ 

« id is maintained at high copy number ,n 5. <*™™ { X\t?< 
, -^ihlr to identify mutations in new genes that are lct..ai o-. 
c P a °;t VupJSS by multiple copies of the plasrmd-borne 

g cne - , „.„ .„ k -„ h„ ro.tigan et al. (40) to 

A similar approacu w« luK^.. 5 

identify mutant? that require the Spa2 protein winch u abo 
involved in polarized cell growth as well as in the £°rphoge 
n^tir rhanees that occur in yeast mating. The synthetic letnai 
cTenldeSd the SLK1 'gene, which is J^.^ ™ r g 
phogenesis in vegetatively growing yeast «J>« J; 
phefomone-treated cells. Costigan et al pointed out tha ^the 
synthetic lethal screen by the colony color assay is ex^me 
sensitive and can identify mutants w.th low ^v.abil ry. 
spa2 andWft/ mutants are tnd.v.dually healthy the screen a 
not simply combine two mutations each caus.ng ""healthiness 
to result "n death, a common concern in using ! ™*od 
Instead, it seems likely that the synthetic letha eff ect often 
results from two different defects in the same cellular process. 

Other Tnthetic lethal screens in yeast cells .nvolve a poison 
assay fn which the presence of a plasmid-borne gene on a 
partLlr medium is letha.; when yeast cells , containing ,«ns 
plasmid are placed on such a medium, there is strong , se lee tion 
For cells that have lost the plasmid. However mutants that 
cannot survive without the plasmid can be identified, because 
the plasTd also contains the gene of interest whose presence 
is reauired in these mutants. Such mutants do not grow on 
reolto 3at« "containing the poison. This approach was used to 
de P S P mutations in th! 3-hy P droxy-3-methylglutary ^coenzyme 
A reductase eenes (12). Alternatively, the gene of interest can 
te^SSWng a regulated promoter, such that mutanu 
that do not survive the repressed condition are identified^ n 
due ble expression of the Jeast RAS2 gene [^^T^ 
cation of mutations in the CYR1 gene, which ^"^deny 
late cyclase (149). Finally, synthetic lethal effects can 
'utov^d by combining mutations identified in other gene 
screens. For example yeast cells ^e = 
sensitive mutation in the SEC4 gene, ess ^ nu ^ . , 
are inviable at the permissive temperature when they aiso 
"ntain a temperature-sensitive mutation ,n certain otto SEC 
genes (190) Yeast cells with mutations in both a-tubulin ana 

^tS^^^^ often lead to the identification 
of belting gene products, other explanations do^ot require 
this physical interaction (101). For example, the tv,o protein. 
m gin both be components of the ^me structure or one pro^ 
tein could regulate the activity of the other. Additionally, there 
are likely to be some cases in which the combination of two 
mutations, either of which causes poor growth on its own, leads 
to complete inviability. 



Overproduction Phenotypes 

Overproduction of wild-type proteins. The overproduction 
of some wild-type proteins can lead to phenotypes that provide 
tosS into protein protein interactions. In S. cerevu.ae a mul- 
Ucopy plasmid often suppresses mutations in genes other than 
he one carried on the plasmid (reviewed in reference 182) 
For example, a temperature-sensitive mutation in the CDC.i, 
Lne which encodes a protein kinase involved ,n controlling 
fell division, can be suppressed by multicopy plasmids carrying 
the CLNi or CLN2 gene, which encode cychns (82). 

In other cases, overproduction of a protein can cause a 
phenotype that is altered by overproduction of an interact ng 
nrotein High-copy-number plasmids expressing either of the 
^histo^e pafrs H2A and H2B or ™™*™ w ™^™ 
inrreased frequency of chromosome loss (142). However, over 
prod^on of both'pairs of h.stone proteins die 1 no, affect he 
fidelity of chromosome transmission, indicating that it is the 
mba Knee of the two dimer sets with respect to one another 
™, a 't"~ this fidelity (142). Overproduction of the yeast Gal4 
So tein ^ the Uanscriptiona. activator of the galactose-inducible 
eenes leads to galactose-independent transcription. However 
oroner regulation is restored if the GalSO protein, a negative 
Sato ?hat binds to the Gal4 protein, is also overproduced 
M59 White the phenotype due to an overproduced wild-type 
protein may reflect interactions with another protein (cither 
mutant or wild type), there are several other mechanisms by 
whS "such phenotypes can occur. For example an overpro- 
duced nrotefn may bypass the transcriptional regulation due to 
another protein. I oTher cases, an overproduced protein ma> 
lead indirectly to the stabilization of a mutant protein. 
' Overproduction of mutant proteins. Overproduction of a 
„ ™f„nrtional version of a protein can result in a mutant pne 

abTe to bfnd DNA results in a nonfunctional repressor (143). 
This End of mutant provides evidence for the multimer.c na- 
^re of the repressor, and analysis of the sites of mutation 
defined the domains involved in DNA binding and in oligomer- 

^similar mechanism may operate in many human cancers 
Tte wSwpe P 53 protein is a transcriptional regulator whic 
I IZZX audits oligomerization domain » »«r*e 
terminus. Mutations in the central domain of P 53 that occu r i 
tumors produce dominant-negative mutant proteins that bin 
to and ^activate the function of the wild-type protein (67) 
Th abi S to manipulate cloned genes and reintroduce thes. 
mu,.m vSrsSns int'o cells now allows dominant-negative mu 
tants to be created in many different organisms. For example 
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dominant-negative Myc proteins were ^^X^/fand 
b .„.„ a „.-i r H,„.,n to inhibit transformation by the v-aM ana 
BCR-ABL oncogenes (194). It was suggested that this cttect 
was due to the mutant Myc protetns competing with the en 
dogenous wild-type Myc protein for b.nd.ng to the Max pro 
tein, thus forming nonfunctional hctcrodimers. 

Unlinked Noncomplcmentation 

individuals heterozygous for two different recessive :muta- 
tions sometimes display a mutant phenotype. This unlinked 
noncomplementation is often interpreted as being due to mu 
tation in two genes that encode intcract.ng produc _ In Dw- 
sophila spp., new recessive mutat 10 ns wcrc . '^"^•^ 
farted to complement B 2 -tubuhn mutations and that mapped to 
other genes (176). At least one of these mutations mapped very 
die fo an Ubu.in gene. A model for this ^"^men- 
tation is based on a minimal dosage requirement for the prod 
uct of two interacting proteins. If the mutant proteins assemble 
randomly with the wild type, the double heterozygote would 
contain only one-fourth the normal level of complex, which 
would be insufficient for function. In addition, when i homozy- 
gous, some of the second-site noncomplementing mutations 
lead to defects in tubulin function, and this property is consis- 
tent with the model. 

POPULAR METHODS TO ESTIMATE AND DETERMINE 
BINDING CONSTANTS 

Importance of Characterization of the Binding Interaction 

The ultimate goal of studying protein-protein i"j« ac " on ^ 
to understand the consequences of the interact on for <xU 
function. This depends in turn on understanding the stren J 
of the interaction in the cell. The determination that tuo ^pro 
teins can interact with one another is only the fir t step in 
understanding if, and to what extent, the tnterac ,on takes 
place in vivo. Evaluation of the interaction acquires the as ess 
ment of at least six parameters, which are discussed I belo« . 

Binding constant. For any simple interaction of one : protc.r. 
(P) with another (L, for ligand), the interaction is ^governed by 

hi binding constant K d , according to the simp e eq> ation^ 
[P f ][L ( ]/[PL]. in this equation. [P fl anc [LI refe mo he fr e 
(i.e.. unbound) concentrations of P and L res P^" - 
Interaction between protein and ligand is also expr ^sc«l in «o 
other wavs First it is often expressed instead as an athnity 
»S = [PL]/[P f ][L f l, i-c, K a = UK,. Second, ,t ,s often 



expressed as a ratio of two rate constants. The rate of forma- 
tion of PL is k a |l',l[L f ], where k a is the association rate con- 
stant and the rate of breakdown of PL is k., [PL], where k , is 
the dissociation rate constant. At equilibrium the rate of for- 
mation of PL equals the rate of breakdown of PL, and Kj = 
kjk a . Evaluation of the dissociation constant is the suojca o. 

this section. . ... 

Concentrations of species. To evaluate the extent to which 
two proteins can interact, the cellular (or compartmental) con- 
centrations of P, (the sum of bound and unbound concentra- 
tions) and L, are required, in addition to the dissociation con- 
stant These two parameters can drastically alter an evaluation 
of the population of molecules in a complex. For example, if K d 
_ , p i = j L i 38% of the species are in the complex PL at any 
one time. If is 10-fold higher (weaker binding) only 8 4% of 
the species are in the complex at one time, and if K a is 10-fold 
lower (stronger binding), 73% of the species are in a comply 
A similar effect holds for alterations in the concentrations of P 
and L in the cell. A simple way of calculating [PL] from the 
easily measured parameters [P ] and Ul , is as : follows: [PL, = 
{([P,] + [L,] + K*V2> - m {( [ P ' ] + [U] K " ] ~ ' ] 
^Influence of competing proteins. Even if a protein has high 
affinity for a ligand protein, L, and the protein and hgand are 
present in sufficient quantities to interact functionally in the 
cell they may not do so in vivo to the same extent as in vitro 
Other ligands may effectively compete for the ligand protein if 
thev are present at high enough concentration and interact 
with sufficient affinity. For example, if the concentrator , of P 
and LI are both equal to the dissociation constant 38% of the 
species are in a complex. If another ligand L2 (or a set of 
potential ligands), is present at 1,000 times the concentration 
of LJ and has 10-fold- lower affinity for P the interaction of P 
with L2 will titrate the vast majority of the protein P (99, 0 . i f 
I 2 was the only interacting protein), leaving very little to .in- 
teract with LI. This sort of consideration is addressed in part 
by protein affinity columns, coimmunoprecip.tat.on experi- 
ments and cross-linking, since all the protetns in the applied 
Extract have equal opportunity to bind. It is not addressed ,n 
affimS Slotting or library-based detection methods, ,n whtch 

Pro'tdn-proTe Vlt^SIS^ other macromolecu.es 
(DNA RNA and proteins) can affect protein-protein interac- 
SrTby forming ternary (or larger) complexes. Such com- 
plexes can be ve 8 ry much more stable than the corresponding 

b ' "ScTrfSSir compartmentation. A protein that is inter- 
actinc with a ligand or a set of ligands is also influenced by Us 
Hon fr 'the cell. For example, some transcription factors are 
Sated in part by their partitioning between the cytoplasm 
and nucleus; fhey can interact with the transcr.ptton machinery 
onlv when they are in the nucleus. «• , ,i,„ 

Solution conditions. Other factors that can affect the 
strength of protein-protein interactions include solution con- 
ditions (salt concentration, pH. etc.), as well as the effects of 
molecules such as polyethylene glycol, which causes macromo- 
lecular crowding and can significantly lower the observed bind- 
ing constant of proteins (see, for example, reference 10S). 

Limits of Binding-Constant Considerations 

The lower limit for the concentration of a protein in an 
oreanism of the size of the yeast S. cerevtsiac is 0.1 nM (as- 
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TABLh 1. Dissociation mnsianis for some well-defined protein-protein interactions 
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TaGTP-yS:PDE7 

TaGDF;PDE-y 

CAP cAMP:RNA polh 

T7 gene 2.5 protein:T7 DNA polymerase 

\ repressor (dimer to tetramer) 

X repressor (monomendimer) 

Citrate synthase: malate dehydrogenase 

C4 binding protein: human protein S 

p85 (PI3K): tyrosine-phosphorylated peptide from PDGF 

CheY:CheA 

CheA:CheW 

VAMP2:syntaxin A 

EGF:EGF receptor 

PKA-C:PKA-R 

PREangiogenin 

ras.raf 

NusB:S10 

NusA: core RNA polymerase 
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Reference 
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fi. an. 


234 


<\ ■ 


10" 11 


Activity 


233 
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10" 10 


int. fl. 


164 
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Kr 9 


int. fl. 
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3 ■ 


10" 5 


fl. an. 


91 


1 • 


10" 6 


fl. an. 


170 


1.1 ■ 


10 6 


fl. an. 


115 


2.3 • 


1CT 6 


fl. an. 
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2 - 


10 -8 


l.z. gf. 


13, 193 
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fl. an. 


214 


6 y 




Solid phase 


158 


5.2 >: 


10" 8 


SPR 


166 


3 ✓ 


10' 8 


SPR 


197 


E3 >' 


lO" 5 


cq. gf. 


72 


4.7 


I0" 6 


SPR 


27 


4.1 • 


lO" 7 


SPR 


249 


■*> i , 
i — i 


lO"' 0 


SPR 


88 


7 ■■ 


10 -lfi 


Fluorescence, exch 


126 


5 ■ 


10" H 


GST ppt'n 


227 


i . 


10" 7 * 


Jutiuov ^iduivui jvu it 


13S 


1 > 


10" 7 


Sucrose gradient sed'n 


80 






Fluorescence tag 


♦76 


6 > 


10 M 


Kinetics, comp'n 


221 



Trypsin:pancreatic trypsin inhibitor 

* Abbreviations: PDE, phosphodiesterase; TaGTP^S, a subunit of transducin complexed with GTP7S; TaGDP, a subunit of transducin complexed with GDP; CAI 
cAMP, catabolite gene activator protein complexed with cAMP; RNA polh, RNA polymerase holoenzyme; PDGF, platelet-derived growth factor; VAMP2, vesicle 
associated membrane protein 2; PKA-C, catalytic subunit of protein kinase A; PKA-R, regulatory subunit of protein kinase A. 

b Abbreviations: fl. an., fluorescence anisotropy; int. fl., intrinsic fluorescence; l.z. gf, large zone equilibrium gel filtration; eq gf., equilibrium gel filtration; SPR 
surface plasmon resonance; exch, exchange; ppt'n, precipitation; sed'n, sedimentation; comp'n, competition. 



suming a radius of 1.5 fxm and one molecule per ceil), and for 
an animal cell with a radius of 10 fxm, the lower limit is about 
0.3 pM. Thus, for two such proteins to interact a significant 
percentage of the time, the dissociation constant must be at the 
same concentration (in which case they will interact 38% of the 
time). At the other extreme, some glycolytic proteins represent 
1% or more of the soluble protein in the cell. Indeed, glycer- 
aIdehyde-3-phosphate dehydrogenase is reported to approach 
20% of the soluble protein in S. cercvisiae under certain con- 
ditions. This upper limit corresponds to 1.7 X 10 7 protein 
molecules per cell and a cellular concentration of 1 mM, and it 
represents the upper limit for binding-constant considerations 
of two such proteins. In considering protein concentrations, it 
is worth noting that a typical yeast cell contains about 3 > 10 5 
ribosomes (226), 100 to 500 molecules of tRNA splicing en- 
zymes (169, 178), and 300,000 molecules of actin (157), 

Methods for Determining Binding Constants 

A number of methods have been described to measure bind- 
ing constants. Some of the more commonly used ones are 
described below, together with a brief evaluation of the 
method. The values of dissociation constants for several pro- 
tein-protein interactions are listed in Table 1. 

Binding to immobilized proteins. Protein affinity chroma- 
tography can be used to estimate the binding constant. This 
method is well described in an excellent review (69). The form 
of the binding equation that is used in this sort of experiment 
expresses the fraction of L bound to protein P as follows: 
[PL]/[L t ] = [P f ]/([P f ] * K d ). As long as the concentration of 
covalentiy bound protein [P t ] is in great excess over that of the 
ltgand, [P t ] ^ [P f J and the fraction of protein L that is bound 
is [V t y{K d + [P t ]). Thus, if [P t ] = 100 K d , essentially all of L is 



bound (a little more than 99%), and if [P t ] = 0.01 K dt very little 
of L is bound (a little less than 1%). 

Columns are prepared with different concentrations of co- 
valentiy bound protein. Then a preparation of the interacting 
protein ligand is loaded on the column and washed with 10 col- 
umn volumes of buffer, and bound protein is eluted with SDS. At 
a concentration of 20 K d , the covalentiy bound protein retains 
95% of the ligand in one column volume and therefore 0.95 10 or 
61% in 10 column volumes. Thus, the lowest concentration of 
bound protein that allows retention of most of the ligand is 20 K d . 

The percentage of bound ligand drops very quickly as the 
concentration of covalentiy bound P on the column is lowered, 
particularly as the concentration of P t approaches K d . At 5 K d 
16% of the ligand would be retained, at 2 K d L7% of the 
protein would be retained, and at 1 K d only 0.1% would be 
retained. It is for this reason that detection of interacting 
proteins by affinity chromatography depends critically on the 
concentration rather than the amount of bound protein (see 
the section on protein affinity chromatography, above). 

An important parameter in this experiment is the amount of 
protein that is active on the column. Estimates range from 10% 
for gene 32 protein to about 50% for others (69). A second 
factor is the amount of pure protein available to be coupled. If 
protein is limiting, sufficiently high concentrations of bound 
protein on the gel are achieved only with appropriate micro- 
columns. Such columns, with as little as 20 \x\ of appropriate 
beads, are described in detail by Formosa et al. (69). With the 
recent widespread use of gene fusion technology, large quan- 
tities of protein are not a serious problem with most cloned 
structural genes. A third factor, which is evident from the 
discussion above, is the form of the protein that is used for the 
determination. Proteins that require modification to be active 
must be purified in that form for proper evaluation. 
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This method works well in estimating the binding constant. 
However, it is not clear that the values obtained represent a 
true equilibrium constant; if so. one would have to assume ha 
the bound ligand is always in equilibrium w.lh the solu n 
ligand during How of the column and that interactions of solid- 
phase bound protein with liquid-phase ligand are the same as 
interactions in the liquid state. Nonetheless, for interact 
that have been measured by more than one method, the results 
agree well (see reference 69 and references therein). 

Sedimentation through gradients. The method of sedimen- 
tation through gradients measures populattons of complexes 
by monitoring the rate of sedimentation of a mixture of pro- 
teins through gradients of glycerol or sucrose. Fractions are 
assayed by appropriate methods (activity, immunoblott.ng, 
II ) io determine the elution positions of each protein. Pro- 
teins will sediment as a complex at concentrates above he 
binding constant (provided that the complex is stable; sec the 
° vr . linns at concentra- 

discussion below) ana ai tueir uaiiTw \>^> - - 

tions below the binding constant By varying the concentration 
of one or both of the proteins and taking into account the 
dilution of the species during sedimentation one can reason- 
ably accurately bracket the binding constant. For example the 
"j r r— i ' v-i-n .^i.in inH nhnsfimal protein 
binding constant ot t. con Nu.sLS p.u..., a..~ i 

S10 was estimated at 11T 7 M based on the observation that S10 
protein scdimcnted faster (with NusB protein) when both were 
at 6 »• 10' 7 M, slightly more slowly when both were at 3 
10- 7 M and much more slowly (midway between its sedimen- 
tation position alone and its fully complexed sedimentation 
pS>n) when both were a, 1.5 x 10 " 7 M (138). There are two 
reasons that S10 sedimentcd at an intermcd.ate ^ l ^ n 
than at its own position during the run at 1.5 * 10 M of each 
protein. First, the proteins arc usually about fivefold more 
dilute at the end of the sedimentation than when they are firs 
loaded on the gradient, therefore if S 10 protein could bind a 
the beginning of the run (and sediment faster), ,t rn ght not 
bind at the more dilute concentration at the end of the run. 
Thus it would sediment at an intermediate position. Second, 
equilibrium binding is a dynamic process and molecules are 
constantly associating and dissociating. Therefore, an individ- 
ual S10 molecule which dissociated from NusB at the tra, ling 
edge of the peak would be in a region with very much less 
NusB to bind. It would sediment at its native rate from that 

PO The°e are two problems associated with this technique. First, 
it is not an equilibrium determination, because of the changing 
conditions during the run. Therefore, failure = to detect an in- 
teraction may be due to rapid equilibrium rather than a lack of 
interaction. As such, values obtained from this type of exper- 
iment represent an upper bound for the binding constant. 
Second, sedimentation through gradients does not ^solve spe- 
cies that well. Sedimentation rates vary as M - for spherical 
molecules. Thus, dimerization of one spherical molecule with 
one that is 1/10 the mass will increase its sedimentation rate by 
only 6%, which is very difficult to detect; in contrast, the 
change in mobility of the smaller molecule will be fivefold 
under such conditions. 

Although this method has limitations, it has been uselul tor 
estimating the upper limit of a binding interaction. 

Gel filtration columns. Gel filtration is another simple way 
of estimating the binding constant. In gel filtration, the c utio 
position of a protein or of a protein complex depends on s 
Stokes radius .This provides a very powerful and conceptually 
Simple method for evaluating the strength o the '"tcn.ct.on 
between two different proteins. Such sizing columns hav be 
used in three distinct ways to measure or to estimate the 
bindine constant. 



(,, Noncuuilibrium "sn.all-/one" gel filtration columns In 
the simplest approach, a solution containing a protein and a 
Imand protein is applied in a small volume to the column and 
the material is resolved in the usual way. T his is called a 
"small-zone" column. The elution positions of the prote.n and 
li'nand in the mixture are compared with those of the protein 
and ligand when each is chromatographed individually on the 
same column. If a complex has formed between the protein 
and ligand, the complex will elute earlier than either protein 
alone From measurements of the concentrations of species 
required to form a complex, one can estimate the binding 
constant. This type of experiment has been used, for example, 
to measure the binding of E. coli NusA protein to core RNA 
polymerase and has yielded values very similar to those deter- 
mined by fluorescence measurements (76). Similarly, Herberg 
and Taylor (89) quantitated the interaction of cAMP-depen- 
dent protein kinase with both the Rl subunit and PKI in the 
presence and absence of MgATP. 

This direct-application method is not a,, cqui.ionum 
m ...>, 0 -i Sinr-c the concentrations of species change during gel 
filtration (by diffusion and by dilution), the results are subject 
to the same sources of error as those of sedimentation through 
sucrose gradients (see. references 2 and 250 for a discussion). 
Thus the binding constants calculated in tins way can be vast .y 
underestimated, particularly if the complex is in rapid equilib- 
rium (sec Fig. 3 of Gegner and Dahlquist 72]) for a v.v.d 
contrast between nonequilibrium and equilibrium gel filtra- 
tion). However, several modeling systems have been described 
(see reference 211 and references therein). 

(ii) Hummel-Dreyer method of equilibrium gel filtration. 
Gel filtration can also be used as an equilibrium method to 
establish the binding constant between a protein and its ligand 
protein. One such method is based on the classic paper by 
Hummel and Ureyer (102). In this gel filtration method, both 
the gel filtration buffer and the sample had ligand at the same 
concentration, but only the sample contained prote.n. Elution 
of a protein through such a column caused an increase m the 
concentration of ligand where the prote.n eluted, followed by 
•i trough of ligand concentration representing ligand that had 
been removed in the binding. Evaluation of the binding con- 
stant of the protein-ligand complex was simply a matter ot 
knowing the concentration of protein eluted, the free concen- 
tration of ligand (set by the column), and the concentration of 
ligand bound with protein (the concentration of ligand in sam- 
nles containing protein). . . 

P This eleganf method has been applied to the interaction of 
two proteins in only a few cases. As illustrated in Fig. 10, the 
gel filtration buffer contains protein ligand, and the applied 
fample contains gel filtration buffer (with the same concentra- 
■ ,n of protein ligand) as well as the °thcr Prote.m Gcgner and 
Dahlquist (72) used a column equilibrated with CheW to dem- 
onstrate and quantitate the interaction of CheA with CheW. 
Thev varied the CheW concentration in the initial sample 
(while maintaining a constant concentration of CheA in the 
sample and CheW in the buffer) and quantitated the peak area 
at the CheW position. The CheW concentration in the sample 
at which there was no resulting CheW peak or trough repre- 
sented a sample at true equilibrium. From this, they could 
calculate a dissociation constant of the complex of 3 p.M A 
similar series of experiments was done by Yong et al. (243) to 
demonstrate an interaction between glycerol-3-phosphate de- 
hydrogenase and lactate dehydrogenase over an extremely lim- 
ited range of NADU concentrations. Such a complex was ob- 
served oniv when the NADH concentration was high enough 
for an interaction and low enough to be shared by the two 
enzvmes and it provided evidence for substrate channeling. 
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FIG 10 Equilibrium gel filtration. A solution containing both protein ligand (solid circles) and interacting protein (open circles) is applied to a gel filtration column 
which is equilibrated with solution containing the inter; cling protein and developed with running buffer containing the interacting protein. The elution pattern is shown 
in the first row of test tubes at the bottom. The second row of test tubes indicates the elution pattern that would be observed in the absence of interacting protein. 



This method is so simple and inexpensive that it is likely to 
become much more widely used than at present. Moreover, as 
an equilibrium experiment, it is without any flaw. The only 
requirements of this technique are that sufficient protein is 
available for the experiments and that the elution position of 
the complex differs from that of at least one of the interacting 
proteins. With the development of rapid techniques for large- 
scale protein purification through the use of fusion proteins, it 
should become relatively routine to obtain enough of any pro- 
tein to use as a column eluant. 

Another variation of Hummel-Dreyer columns is the parti- 
tioning method. In this technique, a protein and its ligand 
protein are mixed with a gel and allowed to equilibrate and the 
gel is centrifuged or filtered to separate the aqueous phase. 
From an analysis of the distribution of the protein and the 
ligand protein in the filtrate and in the gel when they are added 
separately or together, the K d can be calculated. An example of 
this technique is the demonstration of a complex between 
transaminase and glutamate dehydrogenase which occurs with 
a dissociation constant of 16 to 67 fiM, depending on the 
presence of various metabolites (63); this is another example of 
metabolite channeling. This method is also not in wide use, 
although it seems simple and accurate. 

(iii) Large-zone equilibrium gel filtration. One final method 
of equilibrium gel filtration is the large-zone method (1, 2), in 
which a very large sample volume is applied to the column, 
followed by conventional buffer elution. Because a large vol- 
ume is applied, the concentration of the eluted protein is fixed 
and constant during the experiment, except at the leading and 
trailing edges. ITie elution position of the leading or trailing 
edge (which measures the size of the molecule) is then moni- 
tored as a function of the sample concentration applied to the 
column. From such experiments, calculation of the dissociation 
constant is thcrmodynamically rigorous, as it is for the Hum- 



mel-Dreyer method. This large-zone method has been used to 
monitor self-association of proteins as well as interactions of 
dissimilar subunits (see, for example, references 75 and 122), 
but it has received only limited attention because of the large 
amounts of protein needed to do the experiments. 

A variation of this method, first described by Sauer (193), 
monitors the change in elution position of radiolabeled protein 
mixed with different concentrations of unlabeled protein in 
different runs. The use of labeled protein allows simpler and 
more accurate determination of the elution position, thus al- 
lowing Sauer to determine a dimerization constant of 20 nM 
for repressor. Improvements in protein labeling have demon- 
strated that the lower limit of detection for this method is a K d 
of the order of 10 12 M (13). 

Sedimentation equilibrium. Although sedimentation equi- 
librium is a classical method of determining the molecular 
weight of a protein, it has not been widely used to study 
protein-protein interactions. However, recent progress makes 
this method much more accessible on a day-to-day basis (see 
reference 185 for a recent review). Sedimentation equilibrium 
can now be done in everyday preparative ultracentrifuges with 
swinging-bucket rotors, and samples can be readily collected 
because of the development of a highly reproducible BRAN- 
DEL microfraction collector (183). These developments allow 
the use of a variety of techniques to assay the protein content 
of each sample, including kinetic assays, radioactive tracers 
(183), and gel analysis of samples (47); the result is a huge 
increase in sensitivity over that obtained with the old model E 
centrifuge (184). 

Fluorescence methods. Since fluorescence is a highly sensi- 
tive method for detecting proteins through their tryptophan 
residues, it is potentially a useful way of evaluating protein- 
protein interactions. Two such methods have been used and 
are described below. 
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(i) Fluorescence spectrum. Changes in the fluorescence 
emission spectrum on complex formation can occur either by a 
shift in the wavelength of maximum fluorescence emission or 
by a shift in fluorescence intensity caused by the mixing ot two 
proteins. Therefore, the fluorescence intensity at a particular 
wavelength can be used to evaluate the dissociation constant. 
A good example of this technique is illustrated by the interac- 
tion of the 7 subunit of cGMP phosphodiesterase (PDE^) 
subunit with the transducin a subunit (Tor) in the presence of 
GTP-yS or GDP (164). 

An equimolar solution of T01GTP7S and PDE7 causes a 
blue shift in the fluorescence emission spectrum relative to the 
sum of the individual fluorescence spectra, resulting in a dif- 
ference spectrum [F (complex) - F (sum)] with a positive 
component at low wavelengths (320 nm) and a negative com- 
ponent at higher wavelengths (357 nm). Titration of PDE-y into 
3 solution of TaGTPwS therefore caused an enhanced increase 
in the fluorescence at 320 nm relative to that observed by 
titration of PDE-y into buffer alone (and a corresponding de- 
crease at 357 nm) until the TCXGTP7S was all complexed, after 
which further addition PDE7 caused no changes in fluores- 
cence intensity relative to that observed in buffer alone. When 
corrected for PDE7 fluorescence, both curves yielded the same 
binding curve, and the K d for the interaction was evaluated at 
<100 pM. The interaction of TotGDP with PDE7 results in a 
large increase (ca. 70%) in the intensity of the fluorescence 
emission spectrum relative to the sum of the individual spectra, 
and this was used to evaluate the K d at 2.75 nM. 

This technique has two limitations. First, the probability of 
detecting a change in the fluorescence spectrum decreases with 
the total number of tryptophan residues in the two proteins, 
since the fluorescence spectrum is the sum of the contributions 
from all the tryptophan residues. Since PDE7 has only one 
tryptophan residue and Ta has two, this condition was easily 
met in studying the Ta-PDEv complex. Second, the sensitivity 
is limited by the intensity of the fluorescence change, which m 
turn depends on the inherent sensitivity of fluorescence (of the 
order of nanomolar) and the change that is observed (which is 
not easily predictable). Thus, the binding constant was too low 
to evaluate the TaGTP^S-PDE-y interaction (<100 pM) but 
was high enough to evaluate the interaction in the presence of 
GDP (2.75 nM). 

Although these two limitations exclude the study of many 
interactions, a number of proteins have a small or limited 
number of tryptophan residues. For example, bovine Hsc70 
has only two tryptophans, and its interaction with small pep- 
tides has been evaluated because of the resulting quenching of 
the fluorescence intensity (123). Similarly, the interaction of 
angiogenin (one tryptophan) with human placental RNasc in- 
hibitor (six tryptophan residues) causes a 50% increase in 
fluorescence (126), and the dissociation of mitochondrial cre- 
atine kinase (four tryptophans per monomer) from octamers 
to dimcrs results in a 25% decrease in fluorescence (81). 

A second way in which fluorescence is used to measure the 
interaction of proteins is with a fluorescent tag. This allows for 
greater sensitivity of monitoring interactions, as long as the 
fluorescent adducts do not adversely affect the function of the 
modified protein or its interaction with other proteins. An 
example of this approach is the interaction of spinach calm- 
odulin with smooth myosin light-chain kinase (146). Calmodu- 
lin from spinach has a single cysteine, which could be quanti- 
tatively labeled with 2-(4-maleimidoanilino)-naphthalcne-6- 
s. Ifonic acid (MIANS). Calmodulin labeled with MIANS was 
as efficient as the wild type in activating calcineurin, in activat- 
ing cGMP-dependent phosphodiesterase, and in binding ter- 
bium. The fluorescence of MI ANS-labeled calmodulin in- 



creased 80 : /r on binding calcineurin, more than fourfold when 
bound with myosin light-chain kinase, and twofold on binding 
caldesmon In each case, the fluorescence change required the 
presence of calcium, and titrations were done to measure the 
K d (<5, 9> nnd 250 nM, respectively). 

(ii) Fluorescence polarization or anisotropy with tagged 
molecules. Because of the long lifetimes of excited fluorescent 
molecules (nanoseconds), fluorescence can also be used to 
monitor the rotational motion of molecules, which occurs on 
this timescale. This is accomplished experimentally by the use 
of plane-polarized light for excitation, followed by measure- 
ment of the emission at parallel and perpendicular planes. 
Since rotational correlation times depend on the size of the 
molecule (approximately 1 ns/2,400 Da for an idealized mole- 
cule), this method can be used to measure the affinity of two 
proteins for one another because of the increased rotational 
correlation time of the complex. Fluorescence anisotropy is 
done most often with a protein bearing a covalently added 
fluorescent group, which increases both the observed fluores- 
cence lifetime of the excited state and the intensity of the 
fluorescent signal. 

A ^ood example of this technique is described by Weiel and 
Hershey (229), who studied the interaction of protein synthesis 
initiation factor 3 (IF3) with 30S ribosomal subunits by using 
fluorescein-Iabeled IF3. The labeled protein routinely had 
about one dye molecule per monomer, and most of the IF3 
protein had one or two dye molecules attached. Fluorescein- 
Iabeled IF3 was biologically functional: it bound 30S ribosomal 
subunits, as measured by sucrose density gradients, at a satu- 
rable site(s) and had 80 to 100% of the activity of the native 
protein in stimulating binding of tRN A McI to 70S ribosomcs in 
the presence of RNA. In the presence of 30S ribosomes, both 
the fluorescence emission spectrum and the fluorescence life- 
time of the fluorescein-Iabeled IF3 were unchanged. Thus, the 
observed increase in fluorescence polarization which was as- 
sociated with binding of 30S ribosomes was most consistent 
with the expected change in polarization as a result of binding 
a larger molecule. The Scatchard plot derived from the polar- 
ization data gave a stoichiometry of 1:1, and the dissociation 
constant trom the polarization data was 3.2 < 10~ s M. More- 
over, wild-type nonderivatized IF3 competed for the binding 
site with the same binding constant. Thus, the fluorescent 
probe had no effect on any measurable parameter and the 
measured K d is likely to be accurate. 

Similar experiments have been "done with a variety of sys- 
tems to evaluate the strength of protein-protein interactions. 
Fluorescein-Iabeled IF2 was slightly less active than nonderi- 
vatized protein, and the binding to 30S ribosomes was twofold 
weaker than that of the corresponding unlabeled protein (230). 
T7 gene 2.5 protein labeled with near-molar amounts of fluo- 
rescein isothiocyanate caused both a decrease in fluorescence 
and an increase in anisotropy when bound with T7 DNA poly- 
merase. The fluorescein isothiocyanate-modified protein had 
no effect on activity, and the binding constant determined by 
anisotropy (1 p-M) was nearly the same as that determined by 
anisotropy measurements of EDANS-labeled gene 2.5 protein 
(1.3 p-M), for which the rotational correlation time indicated a 
1:1 complex (115). The interaction of (florescein-labeled) ci- 
trate synthase and malate dehydrogenase was shown to be we'll 
within the physiological range (K d = 1 p,M) and varied as much 
as 25-fold in the presence of different metabolites (214). The 
tetramer-dimer equilibrium of X repressor could be observed 
with dansylated X repressor, because of its long fluorescence 
lifetime and high anisotropic value (indicating rigid orienta- 
tion), but not with fluorescein, which was attached in the highly 
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mobile N-tcrminal arm of the repressor molecule (and there- 
fore gave low values) (9). 

A variation of this technique lias been developed tor the 
interaction of a DNA-bmding protein with another protein, in 
which the DNA is fluorcsccntly labeled (91 ). In this way, roll 
CAP could be shown to interact with RNA polymerase hoioen- 
zyme in the presence of cAMP and in the absence of a pro- 
moter site. The fiuorescently labeled DNA oligonucleotide had 
a CAP-binding site but no RNA polymerase-binding site, and 
the resulting increase in polarization allowed the determina- 
tion of a CAP-RNA polymerase binding constant (2.8 < 10 
M). Since this interaction was not observed with a CAP mutant 
protein that was defective in transcription activation, it seems 
likely that the interaction is important physiologically. Other 
fluorescent polarization experiments suggest that the CAP- 
RNA polymerase interaction is much stronger in the presence 
of c.AMP and requires cr factor (170). 

Solution equilibrium measured with immobilized binding 
protein. A simple technique tor measuring the dissociation 
constant of a solution of interacting proteins makes use of 
bound competitor protein to determine the amount of free 
protein in such * solution. This method was first described for 
antibody-antigen reactions (71) and later modified for general 
use to determine the interaction of C4b-binding protein 
(C4BP) with human protein S (HPS) (158). A solution con- 
taining C4BP and HPS was incubated until equilibrium was 
reached. The amount of free C4BP in the solution was then 
determined by incubating an aliquot on a plate containing 
immobilized HPS under conditions (short incubation time) in 
which a limited amount of the free C4BP binds the immobi- 
lized HPS. This resulted in little perturbation of the equilib- 
rium during the assay for C4BP retained by the immobilized 
HPS, which was quantitated by an antibody-based method. 

This method requires satisfaction of three criteria. First, the 
two proteins (HPS in solution and HPS immobilized on the 
plate) cannot bind each other. If they did, C4BP could be 
captured through HPS-HPS interactions. Second, HPS in so- 
lution and HPS immobilized on the plate must compete for the 
same binding site. This is obviously true in this case, but it is 
not necessarily true if, for example, anti-C4BP is used in the 
immobilized system to detect the amount ot free C4BP. Third, 
the method requires that only free C4BP be measured during 
the incubation with immobilized HPS. This in turn requires 
that binding to the immobilized HPS remove only a small 
portion of the total C4BP (<10% was removed in this exam- 
ple) so that equilibrium of the solution is perturbed as little as 
possible. This condition also requires that the off rate of the 
complex is low compared with the time of incubation with the 
immobilized HPS; otherwise, HPS-C4BP complexes could dis- 
sociate during the incubation with immobilized HPS and the 
dissociated C4BP would be measured as free C4BP. Thus, this 
method, although simple, provides only an upper bound of the 
dissociation constant. 

Surface plasmon resonance. The recent development of a 
machine to monitor protein-protein and ligand-receptor inter- 
actions by using changes in surface plasmon resonance mea- 
sured in real time spells the beginning of a minor revolution in 
biology. This method measures complex formation by moni- 
toring changes m the resonance angle of light impinging on a 
gold surface as a result of changes in the refractive index of the 
surface up to 300 nrn away. A ligand of interest (peptide or 
protein in this case) is immobilized on a dextran polymer, and 
a solution of interacting protein is flowed through a cell, one 
wall of which is composed of this polymer. Protein that inter- 
acts with the immobilized ligand is retained on the polymer 
surface, which alters the resonance angle of impinging light as 
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a result ol the change in retractive index brought about by 
increased amounts of protein near the polymer. Since all pro- 
teins have the same refractive index and since there is a linear 
con elation between resonance angle sin is. am i protein concen- 
tration near the surface, ttiis allows one to measure changes in 
protein concentration at the surface due to protein-protein or 
protein-peptide binding. Furthermore, this can be done in real 
time, allowing direct measurement of both the on rate and the 
off rate of complex formation A good layman's review of 
surface plasmon resonance is found in articles by Malmqvist 
(136) and Jonsson et al. (109), and a clear derivation of the 
appropriate equations is found in the article by Karlsson et ah 
(111). 

In practice, determination of a binding constant requires 
measurement of two parameters. First, the increase in RU 
(resonance units) is measured as a function of time by passing 
a solution of interacting protein past the immobilized ligand 
until (usually) the RU values stabilize Second, the decrease in 
RU is measured as a function of lime with butler lacking 
interacting protein. This produces a sensorgram for each con- 
centration of protein, a continuous recording of RU versus 
time. This procedure is then repeated at a number of protein 
concentrations, "after regeneration ot the dextran surface. 
From these two sets of data, two lines are constructed whose 
slopes correspond to k tl (the on rate) and k tt (the off rate): from 
these data, K tt is calculated as kjk a . An alternative determi- 
nation of K tl can be made by using the steady-state RU values 
at different protein concentrations. 

This system has several advantages First, it requires very 
little material. Typically only 1 to 10 u.g of protein has to be 
immobilized on a sensor chip, which can be reused up to 50 
times after removal of adhering protein. Similarly, solutions of 
interacting protein are in the range of 0.01 to 1 mi, depending 
on the chosen flow rate (109). Second, the method is very fast. 
A typical run for a given protein takes about 10 min. Third, no 
modifications of the proteins arc required, such as labeling or 
fluorescent tags. Fourth, interactions can be observed even in 
complex mixtures. Fifth, both the on rate and the off rate arc 
readily obtained. Sixth, the system is useful over a wide range 
of protein concentrations. The practical lower limit of the 
original Biacore system is a change in resonance angle of 10" 
degrees (10 RU), corresponding to surface concentrations of 
10 pg/mm 2 ; moreover, the system is linear up to RU values of 
30,000 (109). Seventh, the system is quite sensitive; the prac- 
tical limit for association rates is 10 6 /M/s, and off rates as low 
as 1.1 >. 10 " 5 /s have been measured bv recording for 6 h with 
buffer (197). 

This technique has been used successfully to monitor pro- 
tein-peptide interactions. A good example is the determination 
of the binding interaction of different SH2 domains with two 
tyrosine-phosphorylated substrate peptides derived from plate- 
let-derived growth factor (166). The corresponding peptides 
were attached to the dextran polymer chip via avidin on the 
chip and biotin on the peptides. Subsequent real-time analysis 
demonstrated that interaction of these peptides with the p85 
subunit of phosphatidylinositoI-3-kinase (PI3K) was character- 
ized by a verv high association rate (2 < 10 6 /M/s) and disso- 
ciation rate (6.1/s) for the 12-mer peptide Y740P and that most 
oi this binding was contributed by the C-terminal subunit of 
p85. In this particular case, the dissociation rate of bound p85 
had to be determined in the presence of a sink of excess 
competing peptide m the buffer; otherwise, rebinding of dis- 
sociated p8.s was a significant problem because of the very high 
on rate. A similar study of p85 SH2 domain interactions with 
different tyrosine-phosphorylated peptides (from IRS- 1 ) led to 
the same conclusions of a high on rate and off rate, which was 




also measured in the presence of a sink of peptide (64). In tins 
case, the on rate was too high to measure directly (as high as 
4.4 X l() H /M/s for the C-terminal SH2 domain of p<S5) and was 
instead inferred from steady-state binding and ofi rate mea- 
surements and confirmed by competition experiments w'Uh 
free phosphorylated peptide (64). On rates in excess of IQ'VM/s 
can be limited by mass transport rates (fluid How through the 
cell) rather than binding-reaction rates, although this can be 
partially compensated by either higher flow rates or a smaller 
amount of peptide on the chip (111). Competition experiments 
were also used to show that the affinity of p85 for phosphory- 
lated peptides was 300- to 800-fold greater than for the corre- 
sponding nonphosphorylated peptide and was as much as 100- 
fold weaker with a glycine or arginine at the +1 position 
relative to the tyrosine compared with bulky hydrophobic 
groups or glutamate (64). 

One final study demonstrated that a specific threonine res- 
idue in the SH2 domain of Src, when changed to a tryptophan, 
increased the affinity of the domain for phosphorylated pep- 
tides which were substrates for GRB2 and that the correspond- 
ing tryptophan of GRB2, when altered to threonine, weakened 
the affinity of GRB2 for this peptide (137). In each of these 
three examples, the primary determinant of specificity was the 
on rate rather than the off rate. 

Surface plasmon resonance has also been used with great 
success to monitor protein-protein interactions. One such ex- 
ample is the demonstration of a quaternary complex of CheY 
with CheA, CheW, and Tar (197). CheY was bound to the 
dextran surface through a unique (and engineered) cysteine 
residue, which did not affect chemotaxis activity and which was 
remote from the interaction domain (197). CheA binds this 
immobilized CheY protein with a low association rate (368/ 
M/s) and a very' low off rate (1.14 X 10" 5 /s). Moreover, CheA, 
CheW, and Tar probably form a quaternary' complex with 
CheY; addition of ail three proteins greatly increases the 
amount of protein bound to CheY relative to that obtained 
with CheA alone, although neither Tar nor CheW binds CheY 
individually or when present together. 

Other examples of protein-protein interactions studied by 
surface plasmon resonance include the interaction of mono- 
clonal antibodies with human immunodeficiency virus type 1 
core protein p24 (111), EGF with the EGF receptor (249), the 
regulatory and catalytic domains of cAMP-dependent protein 
kinase (88), and VAMP2 and sytaxin 1A (27). 

Two minor problems are associated with surface plasmon 
resonance measurements. First, immobilization of the ligand 
protein must be of such a nature that it does not impede or 
artificially enhance interactions. This is the same problem that 
is associated with protein affinity columns. Attachment of 
CheY was accomplished by using a single site remote from the 
interaction domain (197); this presents the interacting face to 
the solvent. Phosphorylated peptides were attached by bioti- 
nylation of the peptide at a single site (but variable position) 
with a long spacer followed by noncovalent interaction with an 
avidin-coupled sensor chip (166), and attachment of monoclo- 
nal antibodies to the chip was accomplished through noncova- 
lent binding to covalently coupled rabbit anti-mouse IGGFc 
(111). Primary amines are often linked directly to the dextran 
polymer, leading to more homogeneous presentation of sur- 
faces to the solvent but causing possible inhomogeneitics in 
interaction (88). Second, the sensor chip has to be regenerated 
under conditions which do not denature the immobilized li- 
gand protein. Protein adhering to the immobilized C subunit of 
protein kinase A was removed with cAMP (88), proteins bind- 
ing to immobilized phosphorylated peptides were removed 
with a pulse of dilute SDS (166), and CheY was regenerated 
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with a pulse of guanidine hydrochloride (197). In some case', 
the ligand is deliberately removed before the next experiment, 
thus, monoclonal antibodies sticking to IGGFc were removed 
with dilute HC1 before readdition of the monoclonal antibod- 
ies to act as a ligand for p24 binding (111). 

Limits to Detection 

Determination of the binding constant of tightly interacting 
species by standard methods described above depends on be- 
ing able to determine and quantitate the fraction of protein 
ligand bound at a given protein concentration that spans the 
dissociation constant. For a standard 50,000-kDa protein, the 
practical limit of silver staining is of the order of 0.2 ng or 20 
u.1 of a 10-ng/ml solution, which would be useful for a dissoci- 
ation constant of 1 nM or greater. For in vitro translated 
protein, the practical limit is 1,000 Ci/mmol times the number 
of amino acid residues, or 1,000 dpm of 35 S-labeled protein per 
fmol (singly labeled); this corresponds to 10" 12 M or, with 10 
residues incoiporated, 10" 13 M; therefore, allowing for cor. 
centrations below K d , the lower limit of detection is of the 
order of 10" 12 M. 

Some protein-protein interactions are too tight (K d < 10 
M) to measure by the methods described above. For example, 
human placental RNase inhibitor (PRI) interacts very tightly 
with both angiogenin (K d = 7 < 10' lft M) (126, 126a) and 
human placental RNase (K d = 9 X 10" 16 M) (199). For the 
interaction of PRI with angiogenin, the association rate con- 
stant, k a , was measured by monitoring the change in intrinsic 
fluorescence by stopped-fiow fluorescence techniques, and the 
dissociation rate constant, k d , was measured by measuring the 
release of PRI in the presence of scavenger RNase, to which it 
binds and inhibits the activity. 

A dissociation constant of the magnitude of 7 < 10 5 " M for 
the PRI-angiogenin interactions means that the dissociation 
rate is measured in weeks! In this case, the r 1/2 for dissociation 
of the complex was 60 days (corresponding to k d = 1.3 < 
10" 7 /s). Furthermore, the overall on rate of 1.8 x 10 8 /M/s 
liters ■ mol/s is near the diffusion limit for molecules of the size 
of proteins. It is hard to imagine what selective pressure would 
require or maintain such a tight interaction. This is particularly 
true since human placental RNase and angiogenin both bind 
PRI equally tightly and are substantially different at the amino 
acid level. 

It is possible that a number of macroscopic protein-protein 
interactions operate at this level. Any protein composed of 
three or more subunits can have significant interactions among 
individual pairs of the component protein. If, for example, a 
subunit has a K d of 10" 7 M with each of two other subunits, the 
effective K d of the dissociation of that subunit from the com- 
plex is 10" 14 M (see reference 116 for a discussion of this 
point). Thus, complicated structures like the ribosome might 
effectively lock the proteins together in undissociable units. It 
is also possible that other, simpler interactions are this tight; 
the dissociation rate of the subunits of a number of proteins 
that purify as a complex tends never to be investigated. 

EXAMPLES OF WELL-CHARACTERIZED DOMAINS 

Given that a straightforward set of experiments is all that is 
required nowadays to identify two proteins that interact and to 
delineate the domains responsible for the binding, toward what 
ends does this analysis continue 0 To address this question, it is 
instructive to consider the case of some domains involved in 
protein-protein interaction that have been extensively charac- 
terized. Using a combination of numerous techniques, includ- 
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FIG. 11. Helical wheel representation of a leucine zipper. Adapled 
reference 221a with permission of the publisher. 



ing detailed structural approaches, investigators who have fo- 
cused on the analysis ofleucine zippers, SH2 domains, and 
SH3 domains have made tremendous advances in the last few- 
years. These studies have considerably extended our under- 
standing of transcriptional regulation and signal transduction. 
In the next sections, we provide a brief view of how these three 
domains function 

Leucine Zipper 

The leucine zipper is a protein-protein interaction motif in 
which there is a cyclical occurrence of leucine residues every 
seventh residue over short stretches of a protein in an a-helix. 
These leucine residues project into an adjacent leucine zipper 
repeat by interdigitating into the adjacent helix, forming a 
stable coiled-cotL This motif was first described by Landschulz 
et al. (124) in connection with a new structure within DNA- 
binding proteins that might be responsible for interactions with 
a similar motif to promote specific DNA binding by basic 
amino acid residues adjacent to the leucine zipper motif 
(hence the name bZIP). The leucine zipper model was origi- 
nally proposed on the basis of the leucine distribution and 
amino acid sequence of regions of C/EBP, Myc, Fos, Jun, and 
Gcn4. It is now known to be common to over 30 proteins (59). 
Subsequent experiments have confirmed the existence of this 
structure and have extended these observations. 

Structure. The X-ray structure of the Gcn4 leucine zipper 
region (consisting of 33 amino acids) demonstrates that the 
leucine zipper consists of two parallel coiled coils of a-hehces 
wrapped around each other and forming one-quarter of a turn 
of a left-handed supercoil (59, 161; also see reference 4). The 
dimer forms a smoothly bent cylinder about 45 A (4.5 nm) long 
and 30 A (3 nm) wide. On a helical-wheel representation of the 
a-helix (Fig. 11), the leucines occupy position d (and d' of the 
adjacent helix) and share the interior with the residues at 
position a (a'), as well as parts of residues e and g (and e' and 
g'). The packing corresponds to the "knobs into holes" model 
proposed by Crick (42), in which each interior amino acid 
residue is packed into a gap formed by four nearest neighbors 
from the opposite helix. More than 95% of the surface area 
that is buried upon dimerization is from the side chains of 
these residues. 

Stability. The leucine zipper coiled coil is stabilized because 
of three factors: the hydrophobic groups that are buried 
( leucines at position d and hydrophobic or neutral residues at 
position a); constancy of size of the internally packing residues 
at each position; and several distinct ion pairs. Three such ion 
pairs appear to form, and each is between the e of one heptad 
and the g of the other. The leucine residues are critical for 
function in Cicn4. Although each individual leucine can itself 
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be replaced by several different hydrophobic residues, random- 
ized substitution of the leucines with other hydrophobic resi- 
dues invariably causes the protein to lose function when more 
t [ ian one leucine is substituted; furthermore, isoleucinc is by 
far the most easily tolerated substitution (98). 

The binding constant of leucine zipper moieties that interact 
is estimated to be in the nanomolar range (163) and has been 
measured at 5 X 10" 8 M for the Jun-Jun dimer at 4°C (196). 
Even a peptide corresponding to the Fos -leucine zipper, which 
does not dimerize in vitro, has been shown to dimerize in the 
micromolar range (163). 

The leucine zipper moieties that naturally interact do not 
necessarily have the maximal stability. For example, the Gcn4 
dimer has a buried asparagine residue which is present within 
the hydrophobic core (59, 161). This Asn residue packs loosely 
in the crystal structure, and this position is particularly tolerant 
of other amino acids (98). Moreover, the asparagine residue 
(and resultant internal hydrogen bond) drastically destabilizes 
the Gcn4 zipper; its replacement with valine stabilizes the 
coiled coil about 1,000-fold (28). It has been speculated that 
the internal asparagine of Gcn4 (and, by extension, other bur- 
ied polar groups in the a position in other leucine zippers) is 
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fore can be subject to regulation, or that it keeps the coiled 
coils in register (4). 

Specificity. The specificity of leucine zippers is the key to 
their regulatory properties. The oncoproteins Fos and Jun, for 
example, associate with each other to form a heterodimer in 
preference to the Jun-Jun homodimer. This preference has 
important consequences in that Fos-Jun heterodimers and 
Jun-Jun homodimers bend DNA in opposite orientations 
(114), which may explain the fact that Jun interaction with the 
glucocorticoid response element of the prolactin gene results 
in activation of the gene, whereas Fos-Jun interaction results in 
repression (51 ). 

Specificity of Fos-Jun and Jun-Jun dimerization is achieved 
primarily by the electrostatic interactions of residues at the e 
and g positions at the periphery of the hydrophobic core (162). 
Fos has Glu residues at the g position, and Fos-Fos dimers are 
much more stable (as measured by T m ) at pH values at which 
these Glu residues are neutralized. Conversely, Jun is slightly 
more basic at the e and g positions, and Jun-Jun dimers are 
more stable at higher pH. Fos-Jun dimers, which are the pref- 
erential form, are uniformly stable over a wide range of pH 
values, because they are more neutral overall. A series of 
hybrid peptides in an otherwise Gcn4 peptide illustrate the 
point (162). Specificity (or antispecificity) is achieved by the 8 
amino acids at the e and g positions of the peptide and not at 
other positions 

Regulation. Leucine zipper proteins are likely to be func- 
tionally regulated. Thus, the carboxyl-terminal zipper of the 
human and Drosophila heat shock factors may suppress forma- 
tion of amino-terminal zippers in a way that is sensitive to heat 
shock (175). Similarly, the calphotin protein binds calcium at 
one end and has a distinctive leucine zipper at the other end 
(8). It may therefore be used to transmit signals by altering 
binding properties. 

SH2 Domain 

The SH2 domain was first recognized as a noncatalytic do- 
main of Src that was homologous to the Fps protein (189) and 
is now recognized as a common motif involved in protein- 
protein interactions (117, 168). More than 20 SH2-containing 
proteins have been identified. They share a motif of about 100 
amino acids that is involved in the recognition of proteins and 
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pcp.KU-.OHHann.nw.ho.phoiylatccl.VH^ncs 1 his .co.jm.iik >» 
is unpluated m the mechanism of signal trnn: t duct,on. because 
the phosphorvlatecl tyrosines that are re.-oj.ni/.ed mcli.de thov: 
„f «mw.l. factor receptors such as the platelet-derived growth 
facun .eceptor, the Id reeeplor, and the fibroblast growth 
factor receptor. On binding then respective growt.i ..ic.or,, t.K 
growih factor receptors have their tyrosine kinase activity ac- 
tivated which allows them to autophosphorylate. 1 he auto- 
phosphorylated receptor then binds various proteins contain- 
ing SH2 doma.ns, which are then phosphorylatcd to modula e 
their activity. Thus, the binding of growth factor on the outside 
of the cell results in phosphorylation on the ins.de of specific 
substrate proteins. The particular proteins that are phosphor- 
ylatcd depend on the binding specificity of the SH2 domains 
for the phosphorated receptor. Binding of different peptides 
to different SH2 domains has yielded the following results. 
Binding of SH2 proteins requires a large domain ot the buz 
b . . .rein ,i,„,v t mc which >s rom- 

protein. The conserved comam ui oIU uu ...a...s>, w...e. ■ 

L to more than two dozen proteins, has been crystallized lor 
Src (224 22S) and solved by nuclear magnetic resonance spec- 
troseopv' techniques for e-Abl (165) and p85a of P.3K (20) In 
each case, this domain folds into a structure in which a set ol 
internal antiparallel sheets is surrounded by two more or .ess 
symmetrical a-heliccs. The conserved amino adds tend to be 
part of the recognition for phosphotyrosine (e.g.. Arg-1 /."- ot 
Src) or part of the hydrophobic pocket. Variable regions arc 
responsible for sequence recognition (205) and may be parts of 
variable loops of unknown function (188). 

Binding of SH2 proteins requires phosphorylated tyrosine in 
vitro Thus, the binding constant of a peptide to an SH_ pro- 
tein of p.XS is between 50- and 800-fold weaker without the 
phosphate than with the phosphate (64). This preference is 
attributable to specific side chain contacts of the SI I. don...... 

with the phosphor^ group of phosphotyrosine. 1 he phosphory! 
oxvuens are hvdro^-n bonded with two guanid.mum hydro- 
gens one from one argin.ne and one from another argmme, 
one hydroxy! hydrogen from threonine and one from serine, 
and a backbone amide hydrogen. One of the argin.ncs appears 
to be acting both as a hydrogen bond donor and as an ion pair 
with the phosphate group. Thus, it cannot be substituted with 
lysine without loss of binding (140). These contacts are the 
same whether a weak-affinity (224) or a strong-affinity 1-5) 
phosphotyrosine-conta.ning peptide is used. 

SH2 domains make contacts with only a small region sur- 
rounding the phosphorylatcd tyrosine. Small peptides fa, h- 
fully reproduce binding to SH2 domain, and display binding 
constants of the orde, of nanomolar (64, 21Sy 1 h-s * consis- 
tent with the crystallographic data of the SH2 domain of % -S c 
bound to a high-affinity 1 1 -amino-acid peptide; the data clearly 
show significant peptide-protein interactions a, 6 o he 
positions of the phosphopcptide, from -2 to t 3, rcUti l _ 
"he tyrosine residue (225). These are the residues tha l a c 
associated high electron density, ind.cat.ng a fixed position n 
the crystal (except lor the side chain portion of Gln-1) . in 
addition to the phosphotyrosinc-binding uUe » 
above, there are several ring interactions that define the :rc t o. 
the phosphotyrosine pocket. There .s also a very wdl-deftned 
interaction of isoleucine at +3 w,th a deep pocket in the SH 

withalvsine amino group, and Glu + 2 -PP^-^^ ^ 

bv a nearby argininc guamdmium and its assoualca H C 
cculcs The ammo auds at positions -1 and -2 appear to cap 



the phosphnivrosLiK- bmdmg through the polypeptide b.u k 
hone at position 1 and the proline ring at -2. 

Other SH2 domain proteins bind different peptides through 
interactions at the same -4 1 to +3 positions relative to the 
-^hov^hol'Tosine. 'I his has been elegantly investigated by 
Son-vang e\ 'ah (205) through a study of selectivity of binding 
of random peptides to different SH2 domains. Although the 
results obtained in this experiment represent bulk selectivity 
for certain amino acids at certain positions relative to phos- 
photyrosine, rather than selectivity of individual peptides of 
known sequence, the results are clear. Each of the three posi- 
tions following the phosphotyrosine plays an important role in 
determining the selectivity of binding in certain SH2 proteins, 
but the amino acids that are crucial and the extent to which 
they are crucial differs markedly. Thus, most of the discrimi- 
nation of the C-terminal SH2 domain of p85 is due to its 
preference for methionine at +3, whereas most of the discrim- 
ination of Nek is at positions 1 and 2, where it prefers gluta- 

, ....... /"?0M 

male and aspauaic, i^p^^^': v *- u -/- 

SH3 Domain 

Th^ SI-H domain is a second noncatalytic domain of Src 
which is involved in protein-protein interactions and which is 
part of a motif shared by other proteins, including tyrosine 
kinases, phosphol.pase C-y (PLC-7) PI3K, GTPase-activatmg 
protein, the cell proliferation proteins Crk and Grb2/Sem:>, 
-md the cytoskcletal proteins spectrin, myosin 1, and an actin- 
b.ndmo protein (see references 117, 120, 154, and 168 for a 
recent list). More than 27 proteins have been shown to have an 
Sm domain, which varies between about 55 and Is amino 
■icids and its structure has been determined from four differ- 
ent specific domains spectrin ( 154), Src (245), PI3K (120), and 
IT C (118) Each such structure is composed of antiparallel 
sheets oriented more or less at right angles to one another (or 
for PI C two partial greek kev motifs of a barrel oriented such 
that the strands on opposite sides cross almost perpendicular- 
ly) and the amino acids in the conserved strands and a con- 
served C-terminal 3 1() helix correspond to many of those that 
are conserved among SH3 proteins. In each case, a hydropho. 
b.c pocket is formed on the surface of the molecule; those of 
P 3K and Src are remarkably similar (120), and the oca on e 
the pocket is conserved between PLC and spectrin (1 18). This 
hydrophobic pocket has been implicated in peptide binding fo 
S c (245) smce binding of such a peptide perturbs the signal 
from these amino acids. There are notable differences among 
he protein structures; PLC, for example is very similar in 
econdary structure to spectrin but not to Src, leading to dif- 
ferent architectures (118). This property presumably leads to 
different binding specificities. , • , - 

The substrates to which SH-Vcontaining proteins bind in- 
clude an uncharacterizcd protein similar to G Pase-activating 
protein-rho, detected with Abl (36); mSosl and hSosl (pro- 
teins similar to Drosophila Sos. which is required for Ras sig- 
naling), detected with Grb2 (187); formin and the rat m* 
muscarinic receptor, detected with Abl (181); PBk. detected 
with v-Src (130); and P 56 ,tk and P 59 >" (172, 1 /.>). 

Like the SH2 domain, the SH3 domain binds simple pep- 
tides with a high degree of sequence specificity and a high 
affinity As judged on a qualitative basis, a 10-amino-acid pro- 
line-rich sequence is responsible for strong binding of the Abl 
SI H domain to two proteins, called 3BIM and 3BP-2 (36. 181). 
This binding is specific in two ways, l-irst, some but not ah 
sinnlc-amino-acid alterations destroy detectable binding- 1 hus_. 
™?.ines at position'- 2, 7, and 10 are important but those at .- 
■ind'to some extent 9 are not Nonproline residues do not 
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Second peptide hmdin^ is SIB domain specific. I hus, /MUM 
binds the SI 13 domains of AM and Src hut not those of Neural 
Src or Ok (m>), ami 31*1*2 binds most strongly to Ab! S 1 43 , less 
so to Src Sin and Grb2, and poorly to Nek (181). 

Similarly, binding of mSosl to Grb2 appears to be through a 
proline-rich motif at the C terminus of the protein (18/); any 
of several proline-rich 11-amino-acid peptides corresponding 
to sequences in this region all compete, and competition ap- 
pears to require a C-terminal arginine. This arginine may add 
selectivity to the binding of mSosl to Grb2. A peptide contain- 
ing the relevant arginine-containing motif binds to Grb2 
through its SH3 domain with a K <s ot 25 nM (128). 

CONCLUDING REMARKS 

Alberts and Miakc-Lyc (5), summarizing a meeting entitled 
Proteins as Machines, described Tom Pollard's flow diagram 
for the detailed analysis of a cell biology process. First, a 
complete inventor) of all the molecules making up the ma- 
chine must be made. Second, a determination must be made of 
how and in what order the molecules interact with each other. 
Third, both detailed rate constants for each transition and 
structures of each component at atomic resolution must be 
obtained. While no process is yet completely understood at the 
three levels described by Pollard, enormous progress has been 
made in deciphering protein machines. In this review, we have 
tried to convey some of the classical and more recent ap- 
proaches used to develop the inventory of proteins and the 
nature of their interactions. 

Two factors are having a large impact on how cellular pro- 
cesses are viewed. First, the vast amount of DNA sequence 
information being obtained means that the identity of almost 
all proteins, at the level of primary sequence, may soon be 
known. Complete sequences for organisms such as E. coli. 
yeast cells, and the nematodes and nearly complete compila- 
tions of the cDNA sequences for human tissues should be 
available in the next few years. Second, the range of new 
procedures now available means that hundreds to thousands of 
new protein-protein interactions may be identified in the same 
period. Ten to twenty years ago, only a few complexes of 
proteins were well characterized as to their subunit composi- 
tion and specific interactions; currently, a large number of such 
complexes are known. Relatively soon, there may be an enor- 
mous number. The continuing challenge will be for biochem- 
ists and cell, molecular, and structural biologists to use this 
information to understand how the cell works. 
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